PATENT 

Attorney Docket No. FORS-04944 
Express Mail Label No: EL 658 777 545 US 

DETECTION OF RNA 

The present application is a continuation-in-part of U.S. Patent Application No. 
09/577,304 and 09/758,282 and is a continuation-in-part of U.S. Patent Application No. 
5 09/350,309, which is a divisional application U.S. Patent No. 5,985,557, and is also a 
continuation-in-part of co-pending application serial no. 09/381,212 which is a national 
entry of PCT application no. US 98/05809, which claims priority to U.S. Patent Nos. 
5,994,069, 6,090,543, 5,985,557, 6,001,567, and 5,846,717 and PCT appHcation no. US 
97/01072. 

10 

FIELD OF THE INVENTION 

The present invention relates to compositions and methods for the detection and 
characterization of nucleic acid sequences and variations in nucleic acid sequences. The 
present invention relates to methods for forming a nucleic acid cleavage structure on a 
1 5 target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. 
For example, in some embodiments, the 5' nuclease activity of a variety of enzymes is 
used to cleave the target-dependent cleavage structure, thereby indicating the presence 
ofspecific nucleic acid sequences or specific variations thereof, 

20 BACKGROUND OF THE INVENTION 

Methods for the detection and characterization ofspecific nucleic acid sequences 
and sequence variations have been used to detect the presence of viral or bacterial nucleic 
acid sequences indicative of an infection, to detect the presence of variants or alleles of 
genes associated with disease and cancers. These methods also find application in the 
25 identification of sources of nucleic acids, as for forensic analysis or for paternity 
determinations. 

Various methods are known to the art that may be used to detect and characterize 
specific nucleic acid sequences and sequence variants. Nonetheless, with the completion 
of the nucleic acid sequencing of the human genome, as well as the genomes of numerous 
30 pathogenic organisms, the demand for fast, reliable, cost-effective and user-friendly tests 
for the detection of specific nucleic acid sequences continues to grow. Importantly, these 
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tests must be able to create a detectable signal from samples that contain very few copies 
of the sequence of interest. The following discussion examines two levels of nucleic acid 
detection assays currently in use: L Signal Amplification Technology for detection of 
rare sequences 11. Direct Detection Technology for quantitative detection of sequences, 
5 and III. Direct Detection of RNA. 

I. Signal Amplification Technology Methods For Amplification 

The "Polymerase Chain Reaction" (PCR) comprises the first generation of 
methods for nucleic acid amplification. However, several other methods have been 
10 developed that employ the same basis of specificity, but create signal by different 

ampUfication mechanisms. These methods include the "Ligase Chain Reaction" (LCR), 
"Self-Sustained Synthetic Reaction" (3SR/NASBA), and "Qp-Rephcase" (QP). 

Polymerase Chain Reaction (PCR) 

1 5 The polymerase chain reaction (PCR), as described in U.S. Patent Nos. 4,683, 195, 

4,683,202, and 4,965,188 to MuUis and MuUis et al (the disclosures of which are hereby 
incorporated by reference), describe a method for increasing the concentration of a 
segment of target sequence in a mixture of genomic DNA without cloning or purification. 
This technology provides one approach to the problems of low target sequence 

20 concentration. PCR can be used to directly increase the concentration of the target to an 
easily detectable level. This process for amplifying the target sequence involves 
introducing a molar excess of two oligonucleotide primers that are complementary to 
their respective strands of the double-stranded target sequence to the DNA mixture 
containing the desired target sequence. The mixture is denatured and then allowed to 

25 hybridize. Following hybridization, the primers are extended with polymerase so as to 
form complementary strands. The steps of denaturation, hybridization, and polymerase 
extension can be repeated as often as needed, in order to obtain relatively high 
concentrations of a segment of the desired target sequence. 

The length of the segment of the desired target sequence is determined by the 

30 relative positions of the primers with respect to each other, and, therefore, this length is a 
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controllable parameter. Because the desired segments of the target sequence become the 
dominant sequences (in terms of concentration) in the mixture, they are said to be 
"PCR-amphfied/' 



Ligase Chain Reaction (LCR or LAR) 

5 The ligase chain reaction (LCR; sometimes referred to as "Ligase Amplification 

Reaction" (LAR) described by Barany, Proc. Natl Acad, Sci., 88:189 (1991); Barany, 
PGR Methods and Applic, 1:5 (1991); and Wu and Wallace, Genomics 4:560 (1989) has 
developed into a well-recognized alternative method for amplifying nucleic acids. In 
LCR, four oHgonucleotides, two adjacent oligonucleotides that imiquely hybridize to one 
10 strand of target DNA, and a complementary set of adjacent oligonucleotides, that 
2f hybridize to the opposite strand are mixed and DNA ligase is added to the mixture. 

£ Provided that there is complete complementarity at the junction, ligase will covalently 

f; „" link each set of hybridized molecules. Importantly, in LCR, two probes are ligated 

together only when they base-pair with sequences in the target sample, without gaps or 
Q 15 mismatches. Repeated cycles of denaturation, hybridization and ligation amplify a short 
J I segment of DNA. LCR has also been used in combination with PCR to achieve enhanced 

|! detection of single-base changes, Segev, PCT Public. No. W09001069 Al (1990). 

However, because the four oligonucleotides used in this assay can pair to form two short 
Ugatable fragments, there is the potential for the generation of target-independent 
20 background signal. The use of LCR for mutant screening is hmited to the examination of 
specific nucleic acid positions. 

Self-Sustained Synthetic Reaction (3SR/NASBA) 

The self-sustained sequence replication reaction (3SR) (Guatelli et al, Proc, Natl. 
Acad. Sci., 87:1874-1878 [1990], with an erratum at Proc, Natl. Acad, Sci., 87:7797 

25 [1990]) is a transcription-based in vitro amphfication system (Kwok et al, Proc. Natl. 
Acad. Sci., 86:1 173-1 177 [1989]) that can exponentially amplify RNA sequences at a 
uniform temperature. The amplified RNA caa then be utihzed for mutation detection 
(Fahy et al, PCR Meth. AppL, 1 :25 [1991]), In this method, an oligonucleotide primer is 
used to add a phage RNA polymerase promoter to the 5' end of the sequence of interest. 

30 In a cocktail of enzymes and substrates that includes a second primer, reverse 
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transcriptase, RNase H, RNA polymerase and ribo-and deoxyribonucleoside 
triphosphates, the target sequence undergoes repeated rounds of transcription, cDNA 
synthesis and second-strand synthesis to ampUfy the area of interest. The use of 3SR to 
detect mutations is kinetically hmited to screening small segments of DNA (e.g., 200-300 
5 base pairs), 

Q-Beta (Qp) Replicase 

In this method, a probe that recognizes the sequence of interest is attached to the 
replicatable RNA template for Qp replicase. A previously identified major problem with 
false positives resulting from the replication of unhybridized probes has been addressed 

10 through use of a sequence-specific ligation step. However, available thermostable DNA 
ligases are not effective on this RNA substrate, so the Ugation must be performed by T4 
DNA Hgase at low temperatures (37°C), This prevents the use of high temperature as a 
means of achieving specificity as in the LCR, the ligation event can be used to detect a 
mutation at the junction site, but not elsewhere. 

15 Table 2 below, lists some of the features desirable for systems useful in sensitive 

nucleic acid diagnostics, and summarizes the abilities of each of the major amplification 
methods (See also, Landgren, Trends in Genetics 9:199 [1993]). 

A successful diagnostic method must be very specific. A straight-forward method 
of controlling the specificity of nucleic acid hybridization is by controlling the 

20 temperature of the reaction. While the 3SR/NASBA, and QP systems are all able to 
generate a large quantity of signal, one or more of the enzymes involved in each cannot 
be used at high temperature (i.e., >55'^C). Therefore the reaction temperatures cannot be 
raised to prevent non-specific hybridization of the probes. If probes are shortened in 
order to make them melt more easily at low temperatures, the likelihood of having more 

25 than one perfect match in a complex genome increases. For these reasons, PGR and LCR 
currently dominate the research field in detection technologies. 
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TABLE 1 



Feature 


Method 


PGR LCR PCR& 3SR QP 
LCR NASBA 


Amplifies Target 


+ 


+ 


+ 


+ 




Recognition of Independent 
Sequences Required 


+ 


+ 






+ 


Performed at High Temp. 




+ 








Operates at Fixed Temp. 








■f 


+ 


Exponential Amplification 


+ 


+ 








Generic Signal Generation 










+ 


Easily Automatable 













The basis of the amplification procedure in the PGR and LCR is the fact that the 
products of one cycle become usable templates in all subsequent cycles, consequently 
5 doubling the population with each cycle. The final yield of any such doubhng system can 
be expressed as: (1+X)" = y, where "X" is the mean efficiency (percent copied in each 
cycle), "n" is the number of cycles, and "y" is the overall efficiency, or yield of the 
reaction (Mullis, PGR Methods AppUc, 1 : 1 [1 99 1 ]). If every copy of a target DNA is 
utilized as a template in every cycle of a polymerase chain reaction, then the mean 
1 0 efficiency is 1 00%. If 20 cycles of PGR are performed, then the yield will be 2^^, or 
1,048,576 copies of the starting material. If the reaction conditions reduce the mean 
efficiency to 85%, then the yield in those 20 cycles will be only 1.85^^, or 220,513 copies 
of the starting material. In other words, a PGR running at 85% efficiency will yield only 
21% as much final product, compared to a reaction running at 100% efficiency. A 
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reaction that is reduced to 50% mean efficiency will yield less than 1% of the possible 
product. 

In practice, routine polymerase chain reactions rarely achieve the theoretical 
maximum yield, and PCRs are usually run for more than 20 cycles to compensate for the 
5 lower yield. At 50% mean efficiency, it would take 34 cycles to achieve the miUion-foId 
amplification theoretically possible in 20, and at lower efficiencies, the number of cycles 
required becomes prohibitive, hi addition, any background products that amplify with a 
better mean efficiency than the intended target will become the dominant products. 

Also, many variables can influence the mean efficiency of PGR, including target 

10 DNA length and secondary structure, primer length and design, primer and dNTP 
concentrations, and buffer composition, to name but a few. Contamination of the 
reaction with exogenous DNA {e,g., DNA spilled onto lab surfaces) or cross- 
contamination is also a major consideration. Reaction conditions must be carefully 
optimized for each different primer pair and target sequence, and the process can take 

1 5 days, even for an experienced investigator. The laboriousness of this process, including 
numerous technical considerations and other factors, presents a significant drawback to 
using PGR in the clinical setting. Indeed, PGR has yet to penetrate the chnical market in 
a significant way. The same concerns arise with LGR, as LCR must also be optimized to 
use different oligonucleotide sequences for each target sequence. In addition, both 

20 methods require expensive equipment, capable of precise temperature cycling. 

Many applications of nucleic acid detection technologies, such as in studies of 
allelic variation, involve not only detection of a specific sequence in a complex 
background, but also the discrimination between sequences with few, or single, 
nucleotide differences. One method for the detection of allele-specific variants by PGR is 

25 based upon the fact that it is difficult for Tag polymerase to synthesize a DNA strand 
when there is a mismatch between the template strand and the 3' end of the primer. An 
allele-specific variant may be detected by the use of a primer that is perfectly matched 
with only one of the possible alleles; the mismatch to the other allele acts to prevent the 
extension of the primer, thereby preventing the amplification of that sequence. This 

30 method has a substantial limitation in that the base composition of the mismatch 

influences the abiUty to prevent extension across the mismatch, and certain mismatches 
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do not prevent extension or have only a minimal effect (Kwok et al, NucL Acids Res,, 
18:999 [1990]).) 

A similar 3*-mismatch strategy is used with greater effect to prevent hgation in the 
LCR(Barany,PCRMeth. Applic, 1:5 [1991]). Any mismatch effectively blocks the 
5 action of the thermostable ligase, but LCR still has the drawback of target-independent 
background hgation products initiating the amphfication. Moreover, the combination of 
PGR with subsequent LCR to identify the nucleotides at individual positions is also a 
clearly cumbersome proposition for the clinical laboratory. 



10 II. Direct Detection Technology 

When a sufficient amount of a nucleic acid to be detected is available, there are 
advantages to detecting that sequence directly, instead of making more copies of that 
target, (e.g., as in PCR and LCR). Most notably, a method that does not amplify the 
signal exponentially is more amenable to quantitative analysis. Even if the signal is 

15 enhanced by attaching multiple dyes to a single oligonucleotide, the correlation between 
the final signal intensity and amount of target is direct. Such a system has an additional 
advantage that the products of the reaction will not themselves promote further reaction, 
so contamination of lab surfaces by the products is not as much of a concern. Traditional 
methods of direct detection including Northern and Southern blotting and RNase 

20 protection assays usually require the use of radioactivity and are not amenable to 
automation. Recently devised techniques have sought to ehminate the use of 
radioactivity and/or improve the sensitivity in automatable formats. Two examples are 
the "Cycling Probe Reaction" (CPR), and "Branched DNA" (bDNA) 

The cycling probe reaction (CPR) (Duck et al, BioTech., 9:142 [1990]), uses a 

25 long chimeric oligonucleotide in which a central portion is made of RNA while the two 
termini are made of DNA. Hybridization of the probe to a target DNA and exposure to a 
thermostable RNase H causes the RNA portion to be digested. This destabiUzes the 
remaining DNA portions of the duplex, releasing the remainder of the probe from the 
target DNA and allowing another probe molecule to repeat the process. The signal, in the 

30 form of cleaved probe molecules, accumulates at a hnear rate. While the repeating 
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process increases the signal, the RNA portion of the oligonucleotide is vulnerable to 
RNases that may be carried through sample preparation. 

Branched DNA (bDNA), described by Urdea et al. Gene 61 :253-264 (1987), 
involves oUgonucleotides with branched structures that allow each individual 
oligonucleotide to carry 35 to 40 labels {e.g., alkaline phosphatase enzymes). While this 
enhances the signal from a hybridization event, signal from non-specific binding is 
similarly increased. 

While both of these methods have the advantages of direct detection discussed 
above, neither the CPR or bDNA methods can make use of the specificity allowed by the 
requirement of independent recognition by two or more probe (oligonucleotide) 
sequences, as is common in the signal amplification methods described in Section L 
above. The requirement that two oligonucleotides must hybridize to a target nucleic acid 
in order for a detectable signal to be generated confers an extra measure of stringency on 
any detection assay. Requiring two oligonucleotides to bind to a target nucleic acid 
reduces the chance that false "positive" results will be produced due to the non-specific 
binding of a probe to the target. The further requirement that the two ohgonucleotides 
must bind in a specific orientation relative to the target, as is required in PCR, where 
ohgonucleotides must be oppositely but appropriately oriented such that the DNA 
polymerase can bridge the gap between the two oUgonucleotides in both directions, 
further enhances specificity of the detection reaction. However, it is well known to those 
in the art that even though PCR utilizes two oligonucleotide probes (termed primers) 
"non-specific" amplification {i.e., amplification of sequences not directed by the two 
primers used) is a common artifact. This is in part because the DNA polymerase used in 
PCR can accommodate very large distances, measured in nucleotides, between the 
oligonucleotides and thus there is a large window in which non-specific binding of an 
ohgonucleotide can lead to exponential amplification of inappropriate product. The 
LCR, in contrast, cannot proceed unless the oligonucleotides used are bound to the target 
adjacent to each other and so the fiill benefit of the dual ohgonucleotide hybridization is 
realized. 
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An ideal direct detection method would combine the advantages of the direct 
detection assays (e.g., easy quantification and minimal risk of carry-over contamination) 
with the specificity provided by a dual oligonucleotide hybridization assay. 



5 III. Direct Detection of RNA 

In molecular medicine, a simple and cost-effective method for direct and 
quantitative RNA detection would greatly facilitate the analysis of RNA viruses and the 
measurement of specific gene expression. Both of these issues are currently pressing 
problems in the field. Despite this need, few techniques have emerged that are truly 
1 0 direct. PCR-based detection assays require conversion of RNA to DNA by reverse 
^ transcriptase before amplification, introducing a variable that can compromise accurate 

p3 quantification. Furthermore, PCR and other methods based on exponential amplification 

% (e.g. , NASBA) require painstaking containment measures to avoid cross-contamination, 

P and have difficulty distinguishing small differences (e.g., 2 to 3 -fold) in quantity. Other 

in 15 tests that directly examine RNA suffer fi*om a variety of drawbacks, including time 
1., consuming autoradiography steps (e.g. , RNase protection assays), or overnight reaction 

U1 times (e.g., branched DNA assays). With over 1.5 million viral load measurements being 

jj; performed in the U.S. every year, there is clearly an enormous potential for an 

f f inexpensive, rapid, high-throughput system for the quantitative measurement of RNA. 

20 Techniques for direct, quantitative detection of mRNA are vital for monitoring 

expression of a number of different genes. In particular, levels of cytokine expression 
(e.g., interleukins and lymphokines) are being exploited as cUnical measures of immune 
response in the progression of a wide variety of diseases (Van Deuren et al, J. Int. Fed. 
Clin. Chem., 5:216 [1993], Van Deuren et al, J. Inf. Dis., 169:157 [1994], Perenboom et 
25 a/., Eur. J. Clin. Invest., 26:159 [1996], Guidotti et al. Immunity 4:25 [1996]) as well as 
in monitoring transplant recipients (Grant et al. Transplantation 62:910 [1996]). 
Additionally, the monitoring of viral load and identification of viral genotype have great 
clinical significance for individuals suffering viral infections by such pathogens as HIV 
or Hepatitis C virus (HCV). There is a high correlation between viral load (i.e., the 
30 absolute number of viral particles in the bloodstream) and time to progression to AIDS 
(Mellors et al, Science 272:1167 [1996], Saag et al. Nature Medicine 2:625 [1996]). 
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For that reason, viral load, as measured by quantitative nucleic acid based testing, is 
becoming a standard monitoring procedure for evaluating the efficacy of treatment and 
the clinical status of HIV positive patients. It is thought to be essential to reduce viral 
load as early in the course of infection as possible and to evaluate viral levels on a regular 
5 basis. In the case of HCV, viral genotype has great clinical significance, with 
correlations to severity of liver disease and responsiveness to interferon therapy. 
Furthermore, because HCV cannot be grown in culture, it is only by establishing 
correlations between characteristics hke viral genotype and chnical outcome that new 
antiviral treatments can be evaluated. 

1 0 While the above mentioned methods have been serviceable for low throughput, 

research apphcations, or for limited clinical apphcation, it is clear that large scale 
quantitative analysis of RNA readily adaptable to any genetic system will require a more 
innovative approach. An ideal direct detection method would combine the advantages of 
the direct detection assays (e.g., easy quantification and minimal risk of carry-over 

15 contamination) with the specificity provided by a dual oligonucleotide hybridization 
assay. 

Many of the methods described above rely on hybridization alone to distinguish a 
target molecule from other nucleic acids. Although some of these methods can be highly 
sensitive, they often cannot quantitate and distinguish closely related mRNAs accurately, 
20 especially such RNAs expressed at different levels in the same sample. While the 
above-mentioned methods are serviceable for some purposes, a need exists for a 
technology that is particularly adept at distinguishing particular RNAs from closely 
related molecules. 



25 SUMMARY OF THE INVENTION 

The present invention relates to compositions and methods for the detection and 
characterization of nucleic acid sequences and variations in nucleic acid sequences. The 
present invention relates to methods for forming a nucleic acid cleavage structure on a 
target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner, 
30 For example, in some embodiments, the 5' nuclease activity of a variety of enzymes is 
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used to cleave the target-dependent cleavage structure, thereby indicating the presence of 
specific nucleic acid sequences or specific variations thereof. 

The present invention provides structure-specific cleavage agents (e.g., nucleases) 
from a variety of sources, including mesophihc, psychrophiHc, thermophilic, and 
5 hyperthermophilic organisms. The preferred structure-specific nucleases are 

thermostable. Thermostable structure-specific nucleases are contemplated as particularly 
usefiil in that they operate at temperatures where nucleic acid hybridization is extremely 
specific, allowing for allele-specific detection (including single-base mismatches). In one 
embodiment, the thermostable structure-specific nucleases are thermostable 5' nucleases 
10 comprising altered polymerases derived fi*om the native polymerases of Thermus 
^ species, including, but not limited to Thermus aquaticus, Thermus flavus, and Thermus 

€J thermophilus. However, the invention is not limited to the use of thermostable 5' 

m nucleases. Thermostable structure-specific nucleases fi:om the FEN-1, RAD2 and XPG 

j class of nucleases are also preferred. 

W 1 5 The present invention provides a metiiod for detecting a target sequence (e.g., a 

7 ^ mutation, polymorphism, etc), comprising providing a sample suspected of containing the 

O target sequence; oligonucleotides capable of forming an invasive cleavage structure in the 

presence of the target sequence; and an agent for detecting the presence of an invasive 
j; cleavage structure; and exposing the sample to the oligonucleotides and the agent. In 

^ 20 some embodiments, the method further comprises the step of detecting a complex 

comprising the agent and the invasive cleavage structure (directly or indirectly). In some 
embodiments, the agent comprises a cleavage agent. In some preferred embodiments, the 
exposing of the sample to the oligonucleotides and the agent comprises exposing the 
sample to the oligonucleotides and the agent under conditions wherein an invasive 
25 cleavage structure is formed between the target sequence and the ohgonucleotides if the 
target sequence is present in the sample, wherein the invasive cleavage structure is 
cleaved by the cleavage agent to form a cleavage product. In some embodiments, the 
method further comprises the step of detecting the cleavage product. In some 
embodiments, the target sequence comprises a first region and a second region, the 
30 second region downstream of and contiguous to the first region, and wherein the 

oligonucleotides comprise first and second oligonucleotides, wherein at least a portion of 
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the first oligonucleotide is completely complementary to the first portion of the target 
sequence and wherein the second oligonucleotide comprises a 3' portion and a 5' portion, 
wherein the 5' portion is completely complementary to the second portion of said target 
nucleic acid. 

5 The present invention also provides a kit for detecting such target sequences, said 

kit comprising ohgonucleotides capable of forming an invasive cleavage structure in the 
presence of the target sequence. In some embodiments, the kit fiirther comprises an agent 
for detecting the presence of an invasive cleavage structure (e.g., a cleavage agent). In 
some embodiments, the ohgonucleotides comprise first and second oligonucleotides, said 
10 first oUgonucleotide comprising a 5' portion complementary to a first region of the target 
nucleic acid and said second oligonucleotide comprising a 3' portion and a 5' portion, said 
5' portion complementary to a second region of the target nucleic acid downstream of and 
contiguous to the first portion. In some preferred embodiments, the target sequence 
comprises 

1 5 The present invention also provides methods for detecting the presence of a target 

nucleic acid molecule by detecting non-target cleavage products comprising providing: a 
cleavage agent; a source of target nucleic acid, the target nucleic acid comprising a first 
region and a second region, the second region downstream of and contiguous to the first 
region; a first oligonucleotide, wherein at least a portion of the first oligonucleotide is 

20 completely complementary to the first portion of the target nucleic acid; and a second 
oUgonucleotide comprising a 3' portion and a 5' portion, wherein the 5' portion is 
completely complementary to the second portion of the target nucleic acid; mixing the 
cleavage agent, the target nucleic acid, the first oligonucleotide and the second 
oligonucleotide to create a reaction mixture under reaction conditions such that at least 

25 the portion of the first oligonucleotide is annealed to the first region of said target nucleic 
acid and wherein at least the 5' portion of the second oligonucleotide is annealed to the 
second region of the target nucleic acid so as to create a cleavage structure, and wherein 
cleavage of the cleavage structure occurs to generate non-target cleavage product; and 
detecting the cleavage of the cleavage structure. 

30 The detection of the cleavage of the cleavage structure can be carried out in any 

manner. In some embodiments, the detection of the cleavage of the cleavage structure 
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comprises detecting the non-target cleavage product. In yet other embodiments, the 
detection of the cleavage of the cleavage structure comprises detection of fluorescence, 
mass, or fluorescence energy transfer. Other detection methods include, but are not 
limited to detection of radioactivity, luminescence, phosphorescence, fluorescence 
5 polarization, and charge, hi some embodiments, detection is carried out by a method 
comprising providing the non-target cleavage product; a composition comprising two 
single-stranded nucleic acids annealed so as to define a single-stranded portion of a 
protein binding region; and a protein; and exposing the non-target cleavage product to the 
single-stranded portion of the protein binding region under conditions such that the 

10 protein binds to the protein binding region. In some embodiments, the protein comprises 
a nucleic acid producing protein, wherein the nucleic acid producing protein binds to the 
protein-binding region and produces nucleic acid. In some embodiments, the protein- 
binding region is a template-dependent RNA polymerase binding region (e.g., a T7 RNA 
polymerase binding region). In other embodiments, the detection is carried out by a 

1 5 method comprising providing the non-target cleavage product; a single continuous strand 
of nucleic acid comprising a sequence defining a single strand of an RNA polymerase 
binding region; a template-dependent DNA polymerase; and a template-dependent RNA 
polymerase; exposing the non-target cleavage product to the RNA polymerase binding 
region under conditions such that the non-target cleavage product binds to a portion of 

20 the single strand of the RNA polymerase binding region to produce a bound non-target 
cleavage product; exposing the bound non-target cleavage product to the template- 
dependent DNA polymerase under conditions such that a double-stranded RNA 
polymerase binding region is produced; and exposing the double-stranded RNA 
polymerase binding region to the template-dependent RNA polymerase under conditions 

25 such that RNA transcripts are produced. In some embodiments, the method fiirther 

comprises the step of detecting the RNA transcripts. In some embodiments, the template- 
dependent RNA polymerase is T7 RNA polymerase. 

The present invention is not limited by the nature of the 3' portion of the second 
oUgonucleotide. In some preferred embodiments, the 3' portion of the second 

30 oUgonucleotide comprises a 3' terminal nucleotide not complementary to the target 
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nucleic acid. In some embodiments, the 3' portion of the second ohgonucleotide consists 
of a single nucleotide not complementary to the target nucleic acid. 

Any of the components of the method may be attached to a solid support. For 
example, in some embodiments, the first oligonucleotide is attached to a sohd support. In 
5 other embodiments, the second oligonucleotide is attached to a solid support. 

The cleavage agent can be any agent that is capable of cleaving invasive cleavage 
structures. In some preferred embodiments, the cleavage agent comprises a structure- 
specific nuclease. In particularly preferred embodiments, the structure-specific nuclease 
comprises a thermostable structure-specific nuclease (e.g., a thermostable 5' nuclease). 
10 Thermostable structure-specific nucleases include, but are not Umited to, those having an 
% amino acid sequence homologous to a portion of the amino acid sequence of a 

thermostable DNA polymerase derived from a thermophihc organism (e.g., Thermus 
i aquaticus, Thermus flavus, and Thermus thermophihis). In other embodiments, the 

f : thermostable structure-specific nuclease comprises a nuclease fromthe FEN-1 , RAD2 or 

01 1 5 XPG classes of nucleases, or chimerical structures containing one or more portions of any 
□ of the above cleavage agents. 

y I The method is not limited by the nature of the target nucleic acid. In some 

4^ embodiments, the target nucleic acid is single stranded or double stranded DNA or RNA, 

2 In some embodiments, double stranded nucleic acid is rendered single stranded (e.g., by 
20 heat) prior to formation of the cleavage structure. In some embodiment, the source of 

target nucleic acid comprises a sample containing genomic DNA. Sample include, but 
are not limited to, blood, saUva, cerebral spinal fluid, pleural fluid, milk, lymph, sputum 
and semen. 

In some embodiments, the reaction conditions for the method comprise providing 
25 a source of divalent cations. In some preferred embodiments, the divalent cation is 
selected from the group comprising Mn2+ and Mg2+ ions. In some embodiments, the 
reaction conditions for the method comprise providing the first and the second 
oligonucleotides in concentration excess compared to the target nucleic acid. 
In some embodiments, the method fixrther comprises providing a third 
30 oligonucleotide complementary to a third portion of said target nucleic acid upstream of 
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the first portion of the target nucleic acid, wherein the third oligonucleotide is mixed with 
the reaction mixture. 

The present invention also provides a method for detecting the presence of a 
target nucleic acid molecule by detecting non-target cleavage products comprising 
5 providing: a cleavage agent; a source of target nucleic acid, the target nucleic acid 
comprising a first region and a second region, the second region downstream of and 
contiguous to the first region; a plurality of first oligonucleotides, wherein at least a 
portion of the first oUgonucleotides is completely complementary to the first portion of 
the target nucleic acid; a second oligonucleotide comprising a 3' portion and a 5' portion, 

1 0 wherein said 5' portion is completely complementary to the second portion of the target 
nucleic acid; mixing the cleavage agent, the target nucleic acid, the plurahty of first 
oligonucleotides and second oligonucleotide to create a reaction mixture under reaction 
conditions such that at least the portion of a first oUgonucleotide is annealed to the first 
region of the target nucleic acid and wherein at least the 5' portion of the second 

15 oUgonucleotide is annealed to the second region of the target nucleic acid so as to create a 
cleavage structure, and wherein cleavage of the cleavage structure occurs to generate 
non-target cleavage product, wherein the conditions permit multiple cleavage structures 
to form and be cleaved from the target nucleic acid; and detecting the cleavage of said 
cleavage structures. In some embodiments, the conditions comprise isothermal 

20 conditions that permit the plurality of first oUgonucleotides to dissociate from the target 
nucleic acid. While the present invention is limited by the nimiber of cleavage structure 
formed on a particular target nucleic acid, in some preferred embodiments, two or more 
(3, 4, 5, . . ., 10, . . 10000, . . .) of the plurality of first oUgonucleotides form cleavage 
structures with a particular target nucleic acid, wherein the cleavage structures are 

25 cleaved to produce the non- target cleavage products. 

The present invention also provides methods wherein a cleavage product from the 
above methods is used in a fiirther invasive cleavage reaction. For example, the present 
invention provides a method comprising providing a cleavage agent; a first target nucleic 
acid, the first target nucleic acid comprising a first region and a second region, the second 

30 region downstream of and contiguous to the first region; a first oligonucleotide, wherein 
at least a portion of the first oligonucleotide is completely complementary to the first 
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portion of the first target nucleic acid; a second oligonucleotide comprising a 3' portion 
and a 5' portion, wherein the 5' portion is completely complementary to the second 
portion of the first target nucleic acid; a second target nucleic acid, said second target 
nucleic acid comprising a first region and a second region, the second region downstream 
5 of and contiguous to the first region; and a third oligonucleotide, wherein at least a 
portion of the third ohgonucleotide is completely complementary to the first portion of 
the second target nucleic acid; generating a first cleavage structure wherein at least said 
portion of the first oligonucleotide is annealed to the first region of the first target nucleic 
acid and wherein at least the 5' portion of the second ohgonucleotide is annealed to the 

10 second region of the first target nucleic acid and wherein cleavage of the first cleavage 
structure occurs via the cleavage agent thereby cleaving the first ohgonucleotide to 
generate a fourth ohgonucleotide, said fourth oligonucleotide comprising a 3* portion and 
a 5' portion, wherein the 5' portion is completely complementary to the second portion of 
the second target nucleic acid; generating a second cleavage structure under conditions 

1 5 wherein at least said portion of the third oligonucleotide is annealed to the first region of 
the second target nucleic acid and wherein at least the 5' portion of the fourth 
oligonucleotide is annealed to the second region of the second target nucleic acid and 
wherein cleavage of the second cleavage structure occurs to generate a cleavage 
fi-agment; and detecting the cleavage of the second cleavage structure. In some preferred 

20 embodiments, the 3' portion of the fourth ohgonucleotide comprises a 3' terminal 

nucleotide not complementary to the second target nucleic acid. In some embodiments, 
the 3' portion of the third oligonucleotide is covalently linked to the second target nucleic 
acid. In some embodiments, the second target nucleic acid further comprises a 5' region, 
wherein the 5' region of the second target nucleic acid is the third oligonucleotide. 

25 The present invention fiirther provides kits comprising: a cleavage agent; a first 
ohgonucleotide comprising a 5' portion complementary to a first region of a target 
nucleic acid; and a second ohgonucleotide comprising a 3' portion and a 5' portion, said 5' 
portion complementary to a second region of the target nucleic acid downstream of and 
contiguous to the first portion. In some embodiments, the 3' portion of the second 

30 oligonucleotide comprises a 3' termhial nucleotide not complementary to the target 
nucleic acid. In preferred embodiments, the 3' portion of the second oligonucleotide 
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consists of a single nucleotide not complementary to the target nucleic acid. In some 
embodiments, the kit further comprises a solid support. For example, in some 
embodiments, the first and/or second oUgonucleotide is attached to said solid support. In 
some embodiments, the kit further comprises a buffer solution. In some preferred 
5 embodiments, the buffer solution comprises a source of divalent cations (e.g., Mn2+ 
and/or Mg2+ ions). In some specific embodiments, the kit fiirther comprises a third 
oligonucleotide complementary to a third portion of the target nucleic acid upstream of 
the first portion of the first target nucleic acid. In yet other embodiments, the kit further 
comprises a target nucleic acid. In some embodiments, the kit further comprises a second 

1 0 target nucleic acid. In yet other embodiments, the kit further comprises a third 

oligonucleotide comprising a 5' portion complementary to a first region of the second 
target nucleic acid. In some specific embodiments, the 3' portion of the third 
oligonucleotide is covalently linked to the second target nucleic acid. In other specific 
embodiments, the second target nucleic acid fiirther comprises a 5' portion, wherein the 5' 

15 portion of the second target nucleic acid is the third oUgonucleotide. In still other 
embodiments, the kit further comprises an ARRESTOR molecule (e.g., ARRESTOR 
oligonucleotide). 

The present invention further provides a composition comprising a cleavage 
structure, the cleavage structure comprising: a) a target nucleic acid, the target nucleic 

20 acid having a first region, a second region, a third region and a fourth region, wherein the 
first region is located adjacent to and downstream from the second region, the second 
region is located adjacent to and downstream from the third region and the third region is 
located adjacent to and downstream from the fourth region; b) a first oligonucleotide 
complementary to the fourth region of the target nucleic acid; c) a second oligonucleotide 

25 having a 5' portion and a 3' portion wherein the 5' portion of the second oligonucleotide 
contains a sequence complementary to the second region of the target nucleic acid and 
wherein the 3' portion of the second oligonucleotide contains a sequence complementary 
to the third region of the target nucleic acid; and d) a third oUgonucleotide having a 5' 
portion and a 3' portion wherein the 5' portion of the third oligonucleotide contains a 

30 sequence complementary to the first region of the target nucleic acid and wherein the 3' 
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portion of the third ohgonucleotide contains a sequence complementary to the second 
region of the target nucleic acid. 

The present invention is not limited by the length of the four regions of the target 
nucleic acid. In one embodiment, the first region of the target nucleic acid has a length of 
5 1 1 to 50 nucleotides. In another embodiment, the second region of the target nucleic acid 
has a length of one to three nucleotides. In another embodiment, the third region of the 
target nucleic acid has a length of six to nine nucleotides. In yet another embodiment, the 
fourth region of the target nucleic acid has a length of 6 to 50 nucleotides. 

The invention is not limited by the nature or composition of the of the first, 

10 second, third and fourth oligonucleotides; these ohgonucleotides may comprise DNA, 

RNA, PNA and combinations thereof as well as comprise modified nucleotides, universal 
bases, adducts, etc. Further, one or more of the first, second, third and the fourth 
oligonucleotides may contain a dideoxynucleotide at the 3' terminus. 

In one preferred embodiment, the target nucleic acid is not completely 

15 complementary to at least one of the first, the second, the third and the fourth 

oligonucleotides. In a particularly preferred embodiment, the target nucleic acid is not 
completely complementary to the second ohgonucleotide. 

As noted above, the present invention contemplates the use of structure-specific 
nucleases in detection methods. In one embodiment, the present invention provides a 

20 method of detecting the presence of a target nucleic acid molecule by detecting non- 
target cleavage products comprising: a) providing: i) a cleavage means, ii) a source of 
target nucleic acid, the target nucleic acid having a first region, a second region, a third 
region and a fourth region, wherein the first region is located adjacent to and downstream 
fi*om the second region, the second region is located adjacent to and downstream from the 

25 third region and the third region is located adjacent to and downstream from the fourth 
region; iii) a first oligonucleotide complementary to the fourth region of the target nucleic 
acid; iv) a second ohgonucleotide having a 5' portion and a 3' portion wherein the 5' 
portion of the second oligonucleotide contains a sequence complementary to the second 
region of the target nucleic acid and wherein the 3' portion of the second ohgonucleotide 

30 contains a sequence complementary to the third region of the target nucleic acid; iv) a 
third oligonucleotide having a 5' and a 3' portion wherein the 5' portion of the third 
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oligonucleotide contains a sequence complementary to the first region of the target 
nucleic acid and wherein the 3' portion of the third oligonucleotide contains a sequence 
complementary to the second region of the target nucleic acid; b) mixing the cleavage 
means, the target nucleic acid, the first ohgonucleotide, the second ohgonucleotide and 
5 the third oligonucleotide to create a reaction mixture under reaction conditions such that 
the first ohgonucleotide is annealed to the fourth region of the target nucleic acid and 
wherein at least the 3' portion of the second ohgonucleotide is annealed to the target 
nucleic acid and wherein at least the 5' portion of the third ohgonucleotide is annealed to 
the target nucleic acid so as to create a cleavage structure and wherein cleavage of the 
10 cleavage structure occurs to generate non-target cleavage products, each non-target 

J5 cleavage product having a 3'-hydroxyl group; and c) detecting the non-target cleavage 

^ products. 

,|: The invention is not hmited by the nature of the target nucleic acid. In one 

fl embodiment, the target nucleic acid comprises single-stranded DNA. In another 

5"^ 15 embodiment, the target nucleic acid comprises double-stranded DNA and prior to step c), 

ri the reaction mixture is treated such that the double-stranded DNA is rendered 

jf ] substantially single-stranded. In another embodiment, the target nucleic acid comprises 

i'i RNA and the first and second oligonucleotides comprise DNA. 

lI The invention is not limited by the nature of the cleavage means. In one 

20 embodiment, the cleavage means is a structure-specific nuclease; particularly preferred 
structure-specific nucleases are thermostable structure-specific nucleases. 

In another preferred embodiment the thermostable structure specific nuclease is a 
chimerical nuclease. 

In an alternative preferred embodiment, the detection of the non-target cleavage 
25 products comprises electrophoretic separation of the products of the reaction followed by 
visuahzation of the separated non-target cleavage products. 

In another preferred embodiment, one or more of the first, second, and third 
oligonucleotides contain a dideoxynucleotide at the 3' terminus. When 
dideoxynucleotide-containing oligonucleotides are employed, the detection of the non- 
30 target cleavage products preferably comprises: a) incubating the non-target cleavage 
products with a template-independent polymerase and at least one labeled nucleoside 
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triphosphate under conditions such that at least one labeled nucleotide is added to the 3'- 
hydroxyl group of the non-target cleavage products to generate labeled non-target 
cleavage products; and b) detecting the presence of the labeled non-target cleavage 
products. The invention is not limited by the nature of the template-independent 
5 polymerase employed; in one embodiment, the template-independent polymerase is 
selected from the group consisting of terminal deoxynucleotidyl transferase (TdT) and 
poly A polymerase. When TdT or polyA polymerase are employed in the detection step, 
the second oligonucleotide may contain a 5' end label, the 5* end label being a different 
label than the label present upon the labeled nucleoside triphosphate. The invention is 

10 not hmited by the nature of the 5' end label; a wide variety of suitable 5' end labels are 
known to the art and include biotin, fluorescein, tetrachlorofluorescein, 
hexachlorofluorescein, Cy3 amidite, Cy5 amidite and digoxigenin. 

In another embodiment, detecting the non-target cleavage products comprises: a) 
incubating the non-target cleavage products with a template-independent polymerase and 

15 at least one nucleoside triphosphate under conditions such that at least one nucleotide is 
added to the 3'-hydroxyl group of the non-target cleavage products to generate tailed non- 
target cleavage products; and b) detecting the presence of the tailed non-target cleavage 
products. The invention is not limited by the nature of the template-independent 
polymerase employed; in one embodiment, the template-independent polymerase is 

20 selected from the group consisting of terminal deoxynucleotidyl transferase (TdT) and 

poly A polymerase. When TdT or polyA polymerases are employed in the detection step, 
the second oligonucleotide may contain a 5' end label. The invention is not limited by the 
nature of the 5' end label; a wide variety of suitable 5' end labels are known to the art and 
include biotin, fluorescein, tetrachlorofluorescein, hexachlorofluorescein, Cy3 amidite, 

25 Cy5 amidite and digoxigenin. 

In a preferred embodiment, the reaction conditions comprise providing a source of 

divalent cations; particularly preferred divalent cations are Mn2+ and Mg2+ ions. 

The present invention further provides a method of detecting the presence of a 
target nucleic acid molecule by detecting non-target cleavage products comprising: a) 
30 providing: i) a cleavage means, ii) a source of target nucleic acid, the target nucleic acid 
having a first region, a second region and a third region, wherein the first region is 
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located adjacent to and downstream from the second region and wherein the second 
region is located adjacent to and downstream from the third region; iii) a first 
oligonucleotide having a 5' and a 3' portion wherein the 5' portion of the first 
oUgonucleotide contains a sequence complementary to the second region of the target 

5 nucleic acid and wherein the 3' portion of the first oUgonucleotide contains a sequence 
complementary to the third region of the target nucleic acid; iv) a second oUgonucleotide 
having a length between eleven to fifteen nucleotides and further having a 5' and a 3' 
portion wherein the 5' portion of the second oligonucleotide contains a sequence 
complementary to the first region of the target nucleic acid and wherein the 3' portion of 

1 0 the second oUgonucleotide contains a sequence complementary to the second region of 
the target nucleic acid; b) mixing the cleavage means, the target nucleic acid, the first 
oUgonucleotide and the second oligonucleotide to create a reaction mixture under 
reaction conditions such that at least the 3' portion of the first oUgonucleotide is annealed 
to the target nucleic acid and wherein at least the 5' portion of the second oUgonucleotide 

1 5 is annealed to the target nucleic acid so as to create a cleavage structure and wherein 
cleavage of the cleavage stiucture occurs to generate non-target cleavage products, each 
non-target cleavage product having a 3'-hydroxyl group; and c) detecting the non-target 
cleavage products. In a preferred embodiment the cleavage means is a stiiicture-specific 
nuclease, preferably a thermostable structure-specific nuclease. 

20 The invention is not limited by the length of the various regions of the target 

nucleic acid. In a preferred embodiment, the second region of the target nucleic acid has 
a length between one to five nucleotides. In another preferred embodiment, one or more 
of the first and the second oUgonucleotides contain a dideoxynucleotide at the 3' 
terminus. When dideoxynucleotide-containing oUgonucleotides are employed, the 

25 detection of the non-target cleavage products preferably comprises: a) incubating the 
non-target cleavage products with a template-independent polymerase and at least one 
labeled nucleoside triphosphate under conditions such that at least one labeled nucleotide 
is added to the 3'-hydroxyl group of the non-target cleavage products to generate labeled 
non-target cleavage products; and b) detecting the presence of the labeled non-target 

3 0 cleavage products. The invention is not limited by the nature of the template-independent 
polymerase employed; in one embodiment, the template-independent polymerase is 
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selected from the group consisting of terminal deoxynucleotidyl transferase (TdT) and 
poly A polymerase. When TdT or polyA polymerase is employed in the detection step, 
the second ohgonucleotide may contain a 5* end label, the 5' end label being a different 
label than the label present upon the labeled nucleoside triphosphate. The invention is 

5 not limited by the nature of the 5' end label; a wide variety of suitable 5' end labels are 
known to the art and include biotin, fluorescein, tetrachlorofluorescein, 
hexachlorofluorescein, Cy3 amidite, Cy5 amidite and digoxigenin. 

In another embodiment, detecting the non-target cleavage products comprises: a) 
incubating the non-target cleavage products with a template-independent polymerase and 

10 at least one nucleoside triphosphate under conditions such that at least one nucleotide is 
added to the 3'-hydroxyl group of the non-target cleavage products to generate tailed non- 
target cleavage products; and b) detecting the presence of the tailed non-target cleavage 
products. The invention is not Umited by the nature of the template-independent 
polymerase employed; in one embodiment, the template-independent polymerase is 

1 5 selected from the group consisting of terminal deoxynucleotidyl transferase (TdT) and 
poly A polymerase. When TdT or polyA polymerases are employed in the detection step, 
the second oligonucleotide may contain a 5' end label The invention is not limited by the 
nature of the 5' end label; a wide variety of suitable 5' end labels are known to the art and 
include biotin, fluorescein, tetrachlorofluorescein, hexachlorofluorescein, Cy3 amidite, 

20 Cy5 amidite and digoxigenin. 

The novel detection methods of the invention may be employed for the detection 
of target DNAs and RNAs including, but not limited to, target DNAs and RNAs 
comprising wild type and mutant alleles of genes, including genes from humans or other 
animals that are or may be associated with disease or cancer. In addition, the methods of 

25 the invention may be used for the detection of and/or identification of strains of 

microorganisms, including bacteria, ftingi, protozoa, ciliates and viruses (and in particular 
for the detection and identification of RNA viruses, such as HCV). 

The present invention ftirther provides novel enzymes designed for direct 
detection, characterization and quantitation of nucleic acids, particularly RNA. The 

30 present invention provides enzymes that recognize specific nucleic acid cleavage 

structures formed on a target RNA sequence and that cleave the nucleic acid cleavage 
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structure in a site-specific manner to produce non-target cleavage products. The present 
invention provides enzymes having an improved ability to specifically cleave a DNA 
member of a complex comprising DNA and RNA nucleic acid strands. 

For example, the present invention provides DNA polymerases that are altered in 

5 structure relative to the native DNA polymerases, such that they exhibit altered (e.g. , 
improved) performance in detection assays based on the cleavage of a structure 
comprising nucleic acid (e.g., RNA). In particular, the altered polymerases of the present 
invention exhibit improved performance in detection assays based on the cleavage of a 
DNA member of a cleavage structure (e.g., an invasive cleavage structure) that comprises 

1 0 an RNA target strand. 

The improved performance in a detection assay may arise from any one of, or a 
combination of several improved features. For example, in one embodiment, the enzyme 
of the present invention may have an improved rate of cleavage (kcat) on a specific 
targeted structure, such that a larger amount of a cleavage product may be produced in a 

1 5 given time span. In another embodiment, the enzyme of the present invention may have 
a reduced activity or rate in the cleavage of inappropriate or non-specific structures. For 
example, in certain embodiments of the present invention, one aspect of improvement is 
that the differential between the detectable amount of cleavage of a specific structure and 
the detectable amount of cleavage of any alternative structures is increased. As such, it is 

20 within the scope of the present invention to provide an enzyme having a reduced rate of 
cleavage of a specific target structure compared to the rate of the native enzyme, and 
having a fiirther reduced rate of cleavage of any alternative structures, such that the 
differential between the detectable amount of cleavage of the specific structure and the 
detectable amount of cleavage of any alternative structures is increased. However, the 

25 present invention is not limited to enzymes that have an improved differential. 

In a preferred embodiment, the enzyme of the present invention is a DNA 
polymerase having an altered nuclease activity as described above, and also having 
altered synthetic activity, compared to that of any native DNA polymerase firom which 
the enzyme has been derived. It is especially preferred that the DNA polymerase is 

30 altered such that it exhibits reduced synthetic activity as well as improved nuclease 

activity on RNA targets, compared to that of the native DNA polymerase. Enzymes and 
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genes encoding enzymes having reduced synthetic activity have been described {See e.g.. 
Kaiser et al, J, Biol Chem., 274:21387 [1999], Lyamichev et al, Prot. Natl. Acad. Sci., 
96:6143 [1999], US. Patents 5,541,311, 5,614,402, 5,795,763 and 6,090,606, 
incorporated herein by reference in their entireties). The present invention contemplates 

5 combined modifications, such that the resulting 5' nucleases are without interfering 
synthetic activity, and have improved performance in RNA detection assays. 

The present invention contemplates a DNA sequence encoding a DNA 
polymerase altered in sequence relative to the native sequence, such that it exhibits 
altered nuclease activity from that of the native DNA polymerase. For example, in one 

10 embodiment, the DNA sequence encodes an enzyme having an improved rate of cleavage 
(kcat) on a specific targeted structure, such that a larger amount of a cleavage product may 
be produced in a given time span. In another embodiment, the DNA encodes an enzyme 
having a reduced activity or rate in the cleavage of inappropriate or non-specific 
structures. In certain embodiments, one aspect of improvement is that the differential 

1 5 between the detectable amount of cleavage of a specific structure and the detectable 

amount of cleavage of any alternative structures is increased. It is within the scope of the 
present invention to provide a DNA encoding an enzyme having a reduced rate of 
cleavage of a specific target structure compared to the rate of the native enzyme, and 
having a further reduced rate of cleavage of any alternative structures, such that the 

20 differential between the detectable amount of cleavage of the specific structure and the 
detectable amount of cleavage of any altemative structures is increased. However, the 
present invention is not limited to polymerases that have an improved differential. 

In a preferred embodiment, the DNA sequence encodes a DNA polymerase 
having the altered nuclease activity described above, and also having altered synthetic 

25 activity, compared to that of any native DNA polymerase from which the improved 

enzyme is derived. It is especially preferred that the encoded DNA polymerase is altered 
such that it exhibits reduced synthetic activity as well as improved nuclease activity on 
RNA targets, compared to that of the native DNA polymerase. 

It is not intended that the invention be limited by the nature of the alteration 

30 required to introduce altered nuclease activity. Nor is it intended that the invention be 
limited by the extent of either the alteration, or in the improvement observed. If the 
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polymerase is also altered so as to be synthesis modified, it is not intended that the 
invention be limited by the polymerase activity of the modified or unmodified protein, or 
by the nature of the alteration to render the polymerase synthesis modified. 

The present invention contemplates structure-specific nucleases fi-om a variety of 
5 sources, including, but not limited to, mesophilic, psychrophilic, thermophihc, and 
hyperthermophilic organisms. The preferred structure-specific nucleases are 
thermostable. Thermostable structure-specific nucleases are contemplated as particularly 
usefiil in that they allow the INVADER assay {See e.g., U.S. Pat. Nos. 5,846,717, 
5,985,557, 5,994,069, 6,001,567, and 6,090,543 and PCT Publications WO 97/27214 and 
1 0 WO 98/42873, incorporated herein by reference in their entireties) to be operated near the 
melting temperature (Tm) of the downstream probe oligonucleotide, so that cleaved and 
uncleaved probes may cycle on and off the target during the course of the reaction. In 
one embodiment, tiie thermostable stinicture-specific enzymes are thermostable 5' 
nucleases that are selected fi:om the group comprising altered polymerases derived firom 
1 5 the native polymerases of Thermus species, including, but not limited to, Thermus 
aquaticus, Thermus flavus, Thermus thermophilus, Thermus filiformus, and Thermus 
scotoductus. However, the invention is not limited to the use of thermostable 5' 
nucleases. For example, certain embodiments of the present invention utilize short 
oligonucleotide probes that may cycle on and off of the target at low temperatures, 
20 allowing the use of non-thermostable enzymes. 

In some preferred embodiments, tiie present invention provides a composition 
comprising an enzyme, wherein the enzyme comprises a heterologous functional domain, 
wherem the heterologous functional domain provides altered {e.g., improved) 
functionality in a nucleic acid cleavage assay. The present mvention is not limited by the 
25 nature of the nucleic acid cleavage assay. For example, nucleic acid cleavage assays 

include any assay m which a nucleic acid is cleaved, directly or indirectly, in the presence 
of the enzyme. In certain preferred embodiments, the nucleic acid cleavage assay is an 
invasive cleavage assay. In particularly preferred embodiments, the cleavage assay 
utiUzes a cleavage stiixcture having at least one RNA component. In another particularly 
30 preferred embodiment, the cleavage assay utilizes a cleavage structure having at least one 
RNA component, wherein a DNA member of the cleavage structure is cleaved. 
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In some preferred embodiments, the enzyme comprises a 5' nuclease or a 
polymerase. In certain preferred embodiments, the 5' nuclease comprises a thermostable 
5' nuclease. In other preferred embodiments, the polymerase is altered in sequence 
relative to a naturally occurring sequence of a polymerase such that it exhibits reduced 
DNA synthetic activity from that of the naturally occurring polymerase. In certain 
preferred embodiments, the polymerase comprises a thermostable polymerase (e.g., a 
polymerase from a Thermus species including, but not limited to, Thermus aquaticus, 
Thermusflavus, Thermus thermophilus, Thermus filiformus, and Thermus scotoductus). 

The present invention is not limited by the nature of the altered functionality 
provided by the heterologous functional domain. Illustrative examples of alterations 
include, but are not limited to, enzymes where the heterologous functional domain 
comprises an amino acid sequence {e.g., one or more amino acids) that provides an 
improved nuclease activity, an improved substrate binding activity and/or improved 
background specificity in a nucleic acid cleavage assay. 

The present invention is not limited by the nature of the heterologous functional 
domain. For example, in some embodiments, the heterologous functional domain 
comprises two or more amino acids from a polymerase domain of a polymerase {e.g., 
introduced into the enzyme by insertion of a chimeric functional domain or created by 
mutation). In certain preferred embodiment, at least one of the two or more amino acids 
is from a palm or thumb region of the polymerase domain. The present invention is not 
limited by the identity of the polymerase from which the two or more amino acids are 
selected. In certain preferred embodiments, the polymerase comprises Thermus 
thermophilus polymerase, hi particularly preferred embodiments, the two or more amino 
acids are from amino acids 300-650 of SEQ ID N0:1. 

The novel enzymes of the invention may be employed for the detection of target 
DNAs and RNAs including, but not limited to, target DNAs and RNAs comprising wild 
type and mutant alleles of genes, including, but not lunited to, genes from humans, other 
animal, or plants that are or may be associated with disease or other conditions. In 
addition, the enzymes of the invention may be used for the detection of and/or 
identification of strains of microorganisms, including bacteria, fungi, protozoa, cihates 
and viruses (and in particular for the detection and identification of viruses having RNA 
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genomes, such as the Hepatitis C and Human Immunodeficiency vkuses). For example, 
the present invention provides methods for cleaving a nucleic acid comprising providing: 
an enzyme of the present invention and a substrate nucleic acid; and exposing the 
substrate nucleic acid to the enzyme {e.g., to produce a cleavage product that may be 
5 detected). 

In one embodiment, tbe present invention provides a thermostable 5' nuclease 
having an amino acid sequence selected fi-om the group comprising SEQ ID N0S:2, 3, 4, 
5, 6, 7, 8, 9, 10, 11 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 
31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 

10 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 341, 346, 348, 351, 353, 359, 365, 
367, 369, 374, 376, 380, 384, 388, 392, 396, 400, 402, 406, 408, 410, 412, 416, 418, 420, 
424, 427, 429, 432, 436, 440, 444, 446, 448, 450, 456, 460, 464, 468, 472, 476, 482, 485, 
488, 491, 494, 496, 498, 500, 502, 506, 510, 514, 518, 522, 526, 530, 534, 538, 542, 544, 
550, 553, 560, 564, 566, 568, 572, 574, 576, 578, 580, 582, 584, 586, 588, and 590. In 

1 5 another embodiment, the 5' nuclease is encoded by a DNA sequence selected from the 
group comprising of SEQ ID NOS:69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 
83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 
105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 
123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 340, 345, 347, 350, 352, 

20 358, 364, 366, 368, 373, 375, 379, 383, 387, 391, 395, 399, 401, 405, 407, 409, 411, 415, 
417, 419, 423, 426, 428, 431, 435, 439, 443, 445, 447, 449, 452, 454, 455, 459, 463, 467, 
471, 475, 481, 484, 495, 497, 499, 501, 505, 509, 513, 517, 521, 525, 529, 533, 537, 541, 
543, 549, 552, 559, 563, 565, 567, 571, 573, 575, 577, 579, 581, 583, 585, 587, and 589. 
The present invention also provides a recombinant DNA vector comprising DNA 

25 having a nucleotide sequence encoding a 5' nuclease, the nucleotide sequence selected 
from the group comprising SEQ ID NOS: 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 
81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 
103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 
121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 340, 345, 347, 

30 350, 352, 358, 364, 366, 368, 373, 375, 379, 383, 387, 391, 395, 399, 401, 405, 407, 409, 
411, 415, 417, 419, 423, 426, 428, 431, 435, 439, 443, 445, 447, 449, 452, 454, 455, 459, 
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463, 467, 471, 475, 481, 484, 495, 497, 499, 501, 505, 509, 513, 517, 521, 525, 529, 533, 
537, 541, 543, 549, 552, 559, 563, 565, 567, 571, 573, 575, 577, 579, 581, 583, 585, 587, 
and 589. In a preferred embodiment, the invention provides a host cell transformed with 
a recombinant DNA vector comprising DNA having a nucleotide sequence encoding a 

5 structure-specific nuclease, the nucleotide selected from the group comprising sequence 
SEQ ID NOS: 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 
89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 
109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 
127, 128, 129, 130, 131, 132, 133, 134, 135, 340, 345, 347, 350, 352, 358, 364, 366, 368, 

10 373, 375, 379, 383, 387, 391, 395, 399, 401, 405, 407, 409, 411, 415, 417, 419, 423, 426, 
428, 431, 435, 439, 443, 445, 447, 449, 452, 454, 455, 459, 463, 467, 471, 475, 481, 484, 
495, 497, 499, 501, 505, 509, 513, 517, 521, 525, 529, 533, 537, 541, 543, 549, 552, 559, 
563, 565, 567, 571, 573, 575, 577, 579, 581, 583, 585, 587, and 589. The invention is not 
limited by the nature of the host cell employed. The art is well aware of expression 

15 vectors suitable for the expression of nucleotide sequences encoding structure-specific 
nucleases that can be expressed in a variety of prokaryotic and eukaryotic host cells. In a 
preferred embodiment, the host cell is an Escherichia colt cell. 

The present invention provides a method of altering 5' nuclease enzymes relative 
to native 5' nuclease enzymes, such that they exhibit improved performance in detection 

20 assays based on the cleavage of a structure comprising RNA. In particular, the altered 5' 
nucleases produced by the method of the present invention exhibit improved performance 
in detection assays based on the cleavage of a DNA member of a cleavage structure {e.g., 
an mvasive cleavage structure) that comprises an RNA target strand. The improved 5' 
nucleases resulting from the methods of the present invention may be improved in any of 

25 the ways discussed herein. Examples of processes for assessing improvement in any 
candidate enzyme are provided. 

For example, the present invention provides methods for producing an altered 
enzyme with improved ftmctionality in a nucleic acid cleavage assay comprising: 
providing an enzyme and a nucleic acid test substrate; introducing a heterologous 

30 functional domain into the enzyme to produce an ahered enzyme; contacting the altered 
enzyme with the nucleic acid test subsfrate to produce cleavage products; and detecting 
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the cleavage products. In some embodiments, the introduction of the heterologous 
functional domain comprises mutating one or more amino acids of the enzyme. In other 
embodiments, the introduction of the heterologous fimctional domain into the enzyme 
comprises adding a functional domain from a protein (e.g., another enzyme) into the 
5 enzyme (e.g. , substituting functional domains by removing a portion of the enzyme 
sequence prior to adding the functional domain of the protein). In preferred 
embodiments, the nucleic acid test substrate comprises a cleavage structure. In 
particularly preferred embodiment, the cleavage structure comprises an RNA target 
nucleic acid. In yet other preferred embodiments, the cleavage structure comprises an 
10 invasive cleavage structure. 

The present invention also provides nucleic acid treatment kits. One preferred 
embodiment is a kit comprising a composition comprising at least one improved 5' 
nuclease. Another preferred embodiment provides a kit comprising: a) a composition 
comprising at least one improved 5' nuclease; and b) an INVADER oligonucleotide and a 
1 5 signal probe oligonucleotide. In some embodiments of the kits of the present invention, 
the improved 5' nuclease is derived from a DNA polymerase from a eubacterial species. 
In fiirther embodiments, the eubacterial species is a thermophile. In still further 
embodiments, the thermophile is of the genus Thermus. In still further embodiments, the 
thermophile is selected from the group consisting of Thermus aquaticus, Thermus flavus, 
20 Thermus thermophilus, Thermus filiformus, and Thermus scotoductus. In preferred 
embodiments, the improved 5' nuclease is encoded by DNA selected from the group 
comprising SEQ ID NOS: 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 
86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 
107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 
25 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 340, 345, 347, 350, 352, 358, 364, 
366, 368, 373, 375, 379, 383, 387, 391, 395, 399, 401, 405, 407, 409, 411, 415, 417, 419, 
423, 426, 428, 431, 435, 439, 443, 445, 447, 449, 452, 454, 455, 459, 463, 467, 471, 475, 
481, 484, 495, 497, 499, 501, 505, 509, 513, 517, 521, 525, 529, 533, 537, 541, 543, 549, 
552, 559, 563, 565, 567, 571, 573, 575, 577, 579, 581, 583, 585, 587, and 589. In yet 
30 other preferred embodiments, the kits further comprise reagents for detecting a nucleic 
acid cleavage product. In further preferred embodiments, the reagents for detecting a 
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cleavage product comprise oligonucleotides for use in a subsequent invasive cleavage 
reaction {See e.g., U.S. Patent No. 5,994,069). In particularly preferred embodiments, the 
reagents for the subsequent invasive cleavage reaction comprise a probe labeled with 
moieties that produce a fluorescence resonance energy transfer (FRET) effect. 

5 The present invention also provides methods for treating nucleic acid, comprising: 

a) providing: a first structure-specific nuclease consisting of an endonuclease in a 
solution containing manganese; and a nucleic acid substrate; b) treating the nucleic acid 
substrate with increased temperature such that the substrate is substantially single- 
stranded; c) reducing the temperature under conditions such that the single-stranded 

10 substrate forms one or more cleavage structures; d) reacting the cleavage means with the 
cleavage structures so that one or more cleavage products are produced; and e) detecting 
the one or more cleavage products. In some embodiments of the methods, the 
endonuclease includes, but is not Umited to, CLEAVASE BN enzyme, Thermus 
aquaticus DNA polymerase, Thermus thermophilus DNA polymerase, Escherichia coli 

1 5 Exo III, and the Saccharomyces cerevisiae Radl/Radl 0 complex. In yet other preferred 
embodiments, the nuclease is a 5' nuclease derived fi-om a thermostable DNA polymerase 
altered in amino acid sequence such that it exhibits reduced DNA synthetic activity firom 
that of the wild-type DNA polymerase but retains substantially the same 5' nuclease 
activity of the wild-type DNA polymerase. In yet other embodiments, the nucleic acid is 

20 selected from the group consisting of RNA and DNA. In further embodiments, the 
nucleic acid of step (a) is double stranded. 

The present invention also provides nucleic acid treatment kits, comprising: a) a 
composition comprising at least one purified FEN-1 endonuclease; and b) a solution 
containing manganese. In some embodiments of the kits, the purified FEN-1 

25 endonuclease is selected from the group consisting Pyrococcus woesei FEN- 1 

endonuclease, Pyrococcus furiosus FEN-1 endonuclease, Methanococcus jannaschii 
FEN-1 endonuclease, Methanobacterium thermoautotrophicum FEN-1 endonuclease, 
Archaeoglobus fulgidus FEN-1, Sulfolobus solfataricus, Pyrobaculum aerophilum, 
Thermococcus litoralis, Archaeaglobus veneficus, Archaeaglobus profundus, Acidianus 

30 brierlyi, Acidianus ambivalens, Desulfurococcus amylolyticus, Desulfurococcus mobilis, 
Pyrodictium brockii, Thermococcus gorgonarius, Thermococcus zilligii, Methanopyrus 
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kandlerh Methanococcus igneus, Pyrococcus horikoshii, Aeropyrum pernix, and 
chimerical FEN-1 endonucleases. In other embodiments, the kits further comprise at 
least one second structure-specific nuclease. In some preferred embodiments, the second 
nuclease is a 5' nuclease derived from a thermostable DNA polymerase altered in amino 
5 acid sequence such that it exhibits reduced DNA synthetic activity from that of the wild- 
type DNA polymerase but retains substantially the same 5' nuclease activity of the wild- 
type DNA polymerase. In yet other embodiments of the kits, the portion of the amino 
acid sequence of the second nuclease is homologous to a portion of the amino acid 
sequence of a thermostable DNA polymerase derived from a eubacterial thermophile of 
10 the genus Thermus. In further embodiments, the thermophile is selected from the group 
consisting of Thermus aquaticus, Thermus flavus and Thermus thermophilus. In yet other 
preferred embodiments, the kits ftirther comprise reagents for detecting the cleavage 
products. 

The present invention further provides any of the compositions, mixtures, 

1 5 methods, and kits described herein, used in conjunction with endonucleases comprising 
Sulfolobus solfataricus, Pyrobaculum aerophilum, Thermococcus litoralis, 
Archaeaglobus veneficus, Archaeaglobus profundus, Acidianus brierlyi Acidianus 
ambivalens, Desulfurococcus amylqlyticus, Desulfurococcus mobilis, Pyrodictium 
brockii Thermococcus gorgonarius, Thermococcus zilligii Methanopyrus kandleri, 

20 Methanococcus igneus, Pyrococcus horikoshii, and Aeropyrum pernix endonucleases. 
These include compositions comprising purified FEN-1 endonucleases from the 
organisms (including specific endonucleases described by sequences provided herein, as 
well as, variants and homologues), kits comprising these compositions, composition 
comprising chimerical endonucleases comprising at least a portion of the endonucleases 

25 from these organisms, kits comprising such compositions, compositions comprising 
nucleic acids encoding the endonucleases from these organisms (including vectors and 
host cells), kits comprising such compositions, antibodies generated to the endonucleases, 
mixtures comprising endonucleases from these organisms, methods of using the 
endonuclease in cleavage assays (e.g., invasive cleavage assays, CFLP, etc.), and kits 

30 containing components useful for such methods. Examples describing the generation, 
structure, use, and characterization of these endonucleases are provided herein. 
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The present invention also provides methods for improving the methods and 
enzymes disclosed herein. For example, the present invention provides methods of 
improving enzymes for any intended purpose (e.g., use in cleavage reactions, 
amphfication reactions, binding reactions, or any other use) comprising the step of 
5 providing an enzyme disclosed herein and modifying the enzyme (e.g., altering the amino 
acid sequence, adding or subtracting sequence, adding post-translational modifications, 
adding any other component whether biological or not, or any other modification). 
Likewise, the present invention provides methods for improving the methods disclosed 
herein comprising, conducting the method steps with one or more changes (e.g., change 
10 in a composition provided in the method, change in the order of the steps, or addition or 
^: subtraction of steps). 

m The improved performance in a detection assay may arise from any one of, or a 

J? combination of several improved features. For example, in one embodiment, the enzyme 

3"; of the present invention may have an improved rate of cleavage (kcat) on a specific 

i i 1 5 targeted structure, such that a larger amount of a cleavage product may be produced in a 
J., given time span. In another embodiment, the enzyme of the present invention may have 

W\ a reduced activity or rate in the cleavage of inappropriate or non-specific structures. For 

% example, in certain embodiments of the present invention, one aspect of improvement is 

F' that the differential between the detectable amount of cleavage of a specific structure and 

20 the detectable amount of cleavage of any alternative structures is increased. As such, it is 
within the scope of the present invention to provide an enzyme having a reduced rate of 
cleavage of a specific target structure compared to the rate of the native enzyme, and 
having a further reduced rate of cleavage of any altemative structures, such that the 
differential between the detectable amount of cleavage of the specific structure and the 
25 detectable amount of cleavage of any altemative structures is increased. However, the 
present invention is not limited to enzymes that have an improved differential. 

In some preferred embodiments, the present invention provides a composition 
comprising an enzyme, wherein the enzyme comprises a heterologous functional domain, 
wherein the heterologous functional domain provides ahered (e.g., improved) 
30 fimctionahty in a nucleic acid cleavage assay. The present invention is not limited by the 
nature of the nucleic acid cleavage assay. For example, nucleic acid cleavage assays 
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include any assay in which a nucleic acid is cleaved, directly or indirectly, in the presence 
of the enzyme. In certain preferred embodiments, the nucleic acid cleavage assay is an 
invasive cleavage assay. In particularly preferred embodiments, the cleavage assay 
utilizes a cleavage structure having at least one RNA component. In another particularly 
5 preferred embodiment, the cleavage assay utihzes a cleavage structure having at least one 
RNA component, wherein a DNA member of the cleavage structure is cleaved. 

The present invention is not limited by the nature of the altered functionality 
provided by the heterologous functional domain. Illustrative examples of alterations 
include, but are not Umited to, enzymes where the heterologous functional domain 
10 comprises an amino acid sequence (e.g., one or more amino acids) that provides an 
improved nuclease activity, an improved substrate binding activity and/or improved 
background specificity in a nucleic acid cleavage assay. 

The present invention is not limited by the nature of the heterologous functional 
domain. For example, in some embodiments, the heterologous functional domain 
1 5 comprises two or more amino acids from a polymerase domain of a polymerase (e.g., 
introduced into the enzyme by insertion of a chimerical functional domain or created by 
mutation). In certain preferred embodiment, at least one of the two or more amino acids 
is from a palm or thumb region of the polymerase domain. The present invention is not 
Umited by the identity of the polymerase from which the two or more amino acids are 
20 selected. In certain preferred embodiments, the polymerase comprises Thermus 

thermophilus polymerase. In particularly preferred embodiments, the two or more amino 
acids are fi:om amino acids 300-650 of SEQ ID N0:1. 

The novel enzymes of the invention may be employed for the detection of target 
DNAs and RNAs including, but not limited to, target DNAs and RNAs comprising wild 
25 type and mutant alleles of genes, including, but not limited to, genes from humans, other 
animal, or plants that are or may be associated with disease or other conditions. In 
addition, the enzymes of the invention may be used for the detection of and/or 
identification of strains of microorganisms, including bacteria, fungi, protozoa, ciliates 
and viruses (and in particular for the detection and identification of viruses having RNA 
30 genomes, such as the Hepatitis C and Human Immunodeficiency viruses). For example, 
the present invention provides methods for cleaving a nucleic acid comprising providing: 



an enzyme of the present invention and a substrate nucleic acid; and exposing the 
substrate nucleic acid to the enzyme (e.g., to produce a cleavage product that may be 
detected). In some embodiments, the substrate nucleic is in a cell lysate sample. 

The present invention also provides a method for detecting the presence of a 

5 target nucleic acid comprising: cleaving an invasive cleavage structure, said invasive 
cleavage structure comprising an RNA target nucleic acid; and detecting the cleavage of 
the invasive cleavage structure. Such an assay may comprise a multiplex assay, wherein 
multiple invasive cleavage structures are cleaved. Such structures include structures 
formed on different target nucleic acids, as well as, structures formed on different 

1 0 locations of the sample target nucleic acid. In some embodiments, the target nucleic acid 
comprises a first region and a second region, said second region downstream of and 
contiguous to said first region. In some embodiments, the invasive cleavage structure 
comprises the target nucleic acid, a first oHgonucleotide, and a second oligonucleotide, 
wherein at least a portion of the first oligonucleotide is completely complementary to the 

1 5 fu-st portion of the first target nucleic acid, and wherein the second oHgonucleotide 
comprises a 3' portion and a 5' portion, wherein the 5' portion is completely 
complementary to said second portion of the target nucleic acid, hi some embodiments, 
the 3' portion of the second oHgonucleotide comprises a 3' terminal nucleotide not 
complementary to said target nucleic acid. In some embodiments, the 3' portion of the 

20 second oHgonucleotide consists of a single nucleotide not complementary to the target 
nucleic acid. In some embodiments, the method further comprises the steps of forming a 
second invasive cleavage structure comprising a non-target cleavage product and 
cleaving the second invasive cleavage structure. In some embodiments, the invasive 
cleavage structure or the second invasive cleavage comprises an oHgonucleotide 

25 comprising a sequence selected fi-om the group consisting of SEQ ID NO:709-2640. hi 
other embodiments, the invasive cleavage structure or the second invasive cleavage 
comprises an oHgonucleotide comprising a sequence selected from the group consisting 
of SEQ ID NO:169-21 1 and 619-706. In some preferred embodiments, the target nucleic 
acid comprises a cytochrome P450 RNA or a cytokine RNA. hi some embodiments, the 

30 first region or the second region of the target nucleic acid encompasses a spHce junction, 
an exon (or a portion thereof), or an intron (or a portion thereof). In some embodiments. 
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the RNA target nucleic acid is provided in a cell lysate. In some embodiments, the first 
oUgonucleotide is covalently attached to the second oligonucleotide. Such 
oUgonucleotides find use, for example, in methods described in U.S. Patent Nos. 
5,714,320 and 5,854,033, herein incorporated by reference in their entireties. The present 
5 invention also provides kits containing one or more of the components used in the above 
methods. 

DEFINITIONS 

To faciUtate an understanding of the present invention, a number of terms and 

10 phrases are defined below: 

As used herein, the terms "complementary" or "complementarity" are used in 
reference to polynucleotides (i.e., a sequence of nucleotides such as an oligonucleotide or 
a target nucleic acid) related by the base-pairing rules. For example, for the sequence " 
5'-A-G-T-3'," is complementary to the sequence " 3'-T-C-A-5'." Complementarity may be 

1 5 "partial," in which only some of the nucleic acids' bases are matched according to the 
base pairing rules. Or, there may be "complete" or "total" complementarity between the 
nucleic acids. The degree of complementarity between nucleic acid strands has 
significant effects on the efficiency and strength of hybridization between nucleic acid 
strands. This is of particular importance in amplification reactions, as well as detection 

20 methods that depend upon binding between nucleic acids. Either term may also be used 
in reference to individual nucleotides, especially within the context of polynucleotides. 
For example, a particular nucleotide within an ohgonucleotide may be noted for its 
complementarity, or lack thereof, to a nucleotide within another nucleic acid strand, in 
contrast or comparison to the complementarity between the rest of the ohgonucleotide 

25 and the nucleic acid strand. Nucleotide analogs used to form non-standard base pairs, 
whether with another nucleotide analog (e.g., an IsoC/IsoG base pair), or with a naturally 
occurring nucleotide (e.g., as described in U.S. Patent 5,912,340, herein incorporated by 
reference in its entirety) are also considered to be complementary to a base pairing 
partner within the meaning this definition. 
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The term "homology" and "homologous" refers to a degree of identity. There 
may be partial homology or complete homology. A partially homologous sequence is 
one that is less than 100% identical to another sequence. 

As used herein, the term "hybridization" is used in reference to the pairing of 
5 complementary nucleic acids. Hybridization and the strength of hybridization {i.e., the 
strength of the association between the nucleic acids) is influenced by such factors as the 
degree of complementary between the nucleic acids, stringency of the conditions 
involved, and the T^ of the formed hybrid. "Hybridization" methods involve the 
annealing of one nucleic acid to another, complementary nucleic acid, Le., a nucleic acid 
10 having a complementary nucleotide sequence. The ability of two polymers of nucleic 
% acid containing complementary sequences to find each other and anneal through base 

flj pairing interaction is a well-recognized phenomenon. The initial observations of the 

> "hybridization" process by Marmur and Lane, Proc. Natl. Acad. Sci. USA 46:453 (1960) 

0 ' and Doty et al., Proc, Natl. Acad, Sci. USA 46:461 (1960) have been followed by the 

§1 15 refinement of this process into an essential tool of modem biology. 

With regard to complementarity, it is important for some diagnostic appUcations 
^ to determine whether the hybridization represents complete or partial complementarity, 

j!: For example, where it is desired to detect simply the presence or absence of pathogen 

rf DNA (such as from a virus, bacterium, fungi, mycoplasma, protozoan) it is only 

20 important that the hybridization method ensures hybridization when the relevant 
sequence is present; conditions can be selected where both partially complementary 
probes and completely complementary probes will hybridize. Other diagnostic 
applications, however, may require that the hybridization method distinguish between 
partial and complete complementarity. It may be of interest to detect genetic 
25 polymorphisms. For example, human hemoglobin is composed, in part, of four 

polypeptide chains. Two of these chains are identical chains of 141 amino acids (alpha 
chains) and two of these chains are identical chains of 146 amino acids (beta chains). 
The gene encoding the beta chain is known to exhibit polymorphism. The normal allele 
encodes a beta chain having glutamic acid at the sixth position. The mutant allele 
30 encodes a beta chain having valine at the sixth position. This difference in amino acids 
has a profound (most profound when the individual is homozygous for the mutant allele) 
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physiological impact known clinically as sickle cell anemia. It is well known that the 
genetic basis of the amino acid change involves a single base difference between the 
normal allele DNA sequence and the mutant allele DNA sequence. 

The complement of a nucleic acid sequence as used herein refers to an 

5 oligonucleotide which, when aUgned with the nucleic acid sequence such that the 5' end 
of one sequence is paired with the 3' end of the other, is in "antiparallel association." 
Certain bases not commonly found in natural nucleic acids may be included in the nucleic 
acids of the present invention and include, for example, inosine and 7-deazaguanine. 
Complementarity need not be perfect; stable duplexes may contain mismatched base 

1 0 pairs or unmatched bases. Those skilled in the art of nucleic acid technology can 

determine duplex stability empirically considering a number of variables including, for 
example, the length of the oligonucleotide, base composition and sequence of the 
oligonucleotide, ionic strength and incidence of mismatched base pairs. 

As used herein, the term " Tm " is used in reference to the "melting temperature." 

1 5 The melting temperature is the temperature at which a population of double-stranded 
nucleic acid molecules becomes half dissociated into single strands. Several equations 
for calculating the Tm of nucleic acids are well known in the art. As indicated by 
standard references, a simple estimate of the Tm value may be calculated by the equation: 
Tm = 81.5 + 0.41(% G + C), when a nucleic acid is in aqueous solution at 1 M NaCl (see 

20 e.g. , Anderson and Young, Quantitative Filter Hybridization, in Nucleic Acid 
Hybridization (1985). Other references {e.g., AUawi, H.T. & SantaLucia, J., Jr. 
Thermodynamics and NMR of internal G.T mismatches in DNA. Biochemistry 36, 
10581-94 (1997) include more sophisticated computations which take structural and 
environmental, as well as sequence characteristics into account for the calculation of Tm- 

25 As used herein the term "stringency" is used in reference to the conditions of 

temperature, ionic strength, and the presence of other compounds, under which nucleic 
acid hybridizations are conducted. With "high stringency" conditions, nucleic acid base 
pahing will occur only between nucleic acid fragments that have a high frequency of 
complementary base sequences. Thus, conditions of "weak" or "low" stringency are 

30 often required when it is desired that nucleic acids that are not completely 
complementary to one another be hybridized or annealed together. 
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"High stringency conditions" when used in reference to nucleic acid hybridization 
comprise conditions equivalent to binding or hybridization at 42 C in a solution 
consisting of 5X SSPE (43.8 g/1 NaCl, 6.9 g/1 NaH2P04 H2O and 1.85 g/1 EDTA, pH 
adjusted to 7.4 with NaOH), 0.5% SDS, 5X Denhardt's reagent and 100 ^g/ml denatured 

5 salmon sperm DNA followed by washing in a solution comprising 0. IX SSPE, 1.0% SDS 
at 42 C when a probe of about 500 nucleotides in length is employed. 

"Medium stringency conditions" when used in reference to nucleic acid 
hybridization comprise conditions equivalent to binding or hybridization at 42 C in a 
solution consisting of 5X SSPE (43.8 g/1 NaCl, 6.9 g/1 NaH2P04 H2O and 1.85 g/1 

10 EDTA, pH adjusted to 7.4 with NaOH), 0.5% SDS, 5X Denhardt's reagent and 100 ^g/ml 
denatured salmon sperm DNA followed by washing in a solution comprising l.OX SSPE, 
1.0% SDS at 42 C when a probe of about 500 nucleotides in length is employed. 

"Low stringency conditions" comprise conditions equivalent to binding or 
hybridization at 42 C in a solution consisting of 5X SSPE (43.8 g/1 NaCl, 6.9 g/1 

1 5 NaH2P04 H2O and 1 .85 g/1 EDTA, pH adjusted to 7.4 with NaOH), 0. 1 % SDS, 5X 
Denhardt's reagent [SOX Denhardt's contains per 500 ml: 5 g FicoU (Type 400, 
Pharamcia), 5 g BSA (Fraction V; Sigma)] and 100 g/ml denatured salmon sperm DNA 
followed by washing in a solution comprising 5X SSPE, 0.1% SDS at 42 C when a probe 
of about 500 nucleotides in length is employed. 

20 The term "gene" refers to a DNA sequence that comprises control and codmg 

sequences necessary for the production of an RNA having a non-coding function (e.g., a 
ribosomal or transfer RNA), a polypeptide or a precursor. The RNA or polypeptide can 
be encoded by a full-length coding sequence or by any portion of the coding sequence so 
long as the desired activity or function is retained. 

25 The term "wild-type" refers to a gene or a gene product that has the characteristics 

of that gene or gene product when isolated from a naturally occurring source. A wild- 
type gene is that which is most frequently observed in a population and is thus arbitrarily 
designated the "normal" or "wild-type" form of the gene. In contrast, the term 
"modified," "mutant," or "polymorphic" refers to a gene or gene product that displays 

30 modifications in sequence and or functional properties (i.e. , altered characteristics) when 
compared to the wild-type gene or gene product. It is noted that naturally-occurring 
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mutants can be isolated; these are identified by the fact that they have ahered 
characteristics when compared to the wild-type gene or gene product. 

The term "recombinant DNA vector" as used herein refers to DNA sequences 
containing a desired heterologous sequence. For example, although the term is not 
5 limited to the use of expressed sequences or sequences that encode an expression product, 
in some embodiments, the heterologous sequence is a coding sequence and appropriate 
DNA sequences necessary for either the replication of the coding sequence in a host 
organism, or the expression of the operably linked coding sequence in a particular host 
organism. DNA sequences necessary for expression in prokaryotes include a promoter, 
1 0 optionally an operator sequence, a ribosome-binding site and possibly other sequences. 
5 Eukaryotic cells are known to utihze promoters, polyadenlyation signals and enhancers, 

m The term "LTR" as used herein refers to the long terminal repeat found at each 

% end of a provirus (i.e. , the integrated form of a retrovirus). The LTR contams numerous 

ff^ regulatory signals including transcriptional control elements, polyadenylation signals and 

1 5 sequences needed for repUcation and integration of the viral genome. The viral LTR is 
L. divided into three regions called U3, R and U5 . 

y1 The U3 region contains the enhancer and promoter elements. The U5 region 

p contains the polyadenylation signals. The R (repeat) region separates the U3 and U5 

F;^ regions and transcribed sequences of the R region appear at both the 5' and 3' ends of the 

20 viral RNA. 

The term "oHgonucleotide" as used herein is defmed as a molecule comprising 
two or more deoxyribonucleotides or ribonucleotides, preferably at least 5 nucleotides, 
more preferably at least about 10-15 nucleotides and more preferably at least about 15 to 
30 nucleotides. The exact size will depend on many factors, which in turn depend on the 
25 ultmiate fimction or use of the oligonucleotide. The oHgonucleotide may be generated in 
any manner, including chemical synthesis, DNA rephcation, reverse transcription, PCR, 
or a combination thereof 

Because mononucleotides are reacted to make ohgonucleotides in a manner such 
that the 5' phosphate of one mononucleotide pentose ring is attached to the 3' oxygen of 
30 its neighbor in one direction via a phosphodiester linkage, an end of an oligonucleotide is 
referred to as the "5' end" if its 5' phosphate is not linked to the 3' oxygen of a 
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mononucleotide pentose ring and as the "3' end" if its 3' oxygen is not linked to a 5' 
phosphate of a subsequent mononucleotide pentose ring. As used herein, a nucleic acid 
sequence, even if internal to a larger oUgonucleotide, also may be said to have 5' and 3' 
ends. A first region along a nucleic acid strand is said to be upstream of another region if 
5 the 3' end of the first region is before the 5' end of the second region when moving along 
a strand of nucleic acid in a 5' to 3' direction. 

When two different, non-overlapping oHgonucleotides anneal to different regions 
of the same linear complementary nucleic acid sequence, and the 3' end of one 
ohgonucleotide points towards the 5' end of the other, the former may be called the 
10 "upstream" oligonucleotide and the latter the "downstream" oligonucleotide. Similarly, 
when two overlapping oligonucleotides are hybridized to the same linear complementary 
nucleic acid sequence, with the first oUgonucleotide positioned such that its 5' end is 
upstream of the 5' end of the second oUgonucleotide, and the 3' end of the first 
oligonucleotide is upstream of the 3' end of the second oUgonucleotide, the first 
1 5 oligonucleotide may be called the "upstream" oUgonucleotide and the second 
oligonucleotide may be called the "downstream" oUgonucleotide. 

The term "primer" refers to an oligonucleotide that is capable of acting as a point 
of initiation of synthesis when placed under conditions in which primer extension is 
initiated. An oUgonucleotide "primer" may occur naturally, as in a purified restriction 
20 digest or may be produced synthetically. 

A primer is selected to be "substantially" complementary to a strand of specific 
sequence of the template. A primer must be sufficiently complementary to hybridize 
with a template strand for primer elongation to occur. A primer sequence need not reflect 
the exact sequence of the template. For example, a non-complementary nucleotide 
25 fragment may be attached to the 5' end of the primer, with the remainder of the primer 
sequence being substantially complementary to the strand. Non-complementary bases or 
longer sequences can be interspersed into the primer, provided that the primer sequence 
has sufficient complementarity with the sequence of the template to hybridize and 
thereby form a template primer complex for synthesis of the extension product of the 
30 primer. 
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The term "label" as used herein refers to any atom or molecule that can be used to 
provide a detectable (preferably quantifiable) effect, and that can be attached to a nucleic 

32 

acid or protein. Labels include but are not limited to dyes; radiolabels such as P; 
binding moieties such as biotin; haptens such as digoxgenin; luminogenic, 

5 phosphorescent or fluorogenic moieties; and fluorescent dyes alone or in combination 
with moieties that can suppress or shift emission spectra by fluorescence resonance 
energy transfer (FRET). Labels may provide signals detectable by fluorescence, 
radioactivity, colorimetry, gravimetry, X-ray diffraction or absorption, magnetism, 
enzymatic activity, and the like, A label may be a charged moiety (positive or negative 

10 charge) or alternatively, may be charge neutral. Labels can include or consist of nucleic 
acid or protein sequence, so long as the sequence comprising the label is detectable. 

The term "signal" as used herein refers to any detectable effect, such as would be 
caused or provided by a label or an assay reaction. 

As used herein, the term "detector" refers to a system or component of a system, 

15 e.g., an instrument (e.g. a camera, fluorimeter, charge-coupled device, scintillation 

counter, etc.) or a reactive medium (X-ray or camera film, pH indicator, etc.), that can 
convey to a user or to another component of a system (e.g., a computer or controller) the 
presence of a signal or effect, A detector can be a photometric or spectrophotometric 
system, which can detect ultraviolet, visible or infrared Ught, including fluorescence or 

20 chemiluminescence; a radiation detection system; a spectroscopic system such as nuclear 
magnetic resonance spectroscopy, mass spectrometry or surface enhanced Raman 
spectrometry; a system such as gel or capillary electrophoresis or gel exclusion 
chromatography; or other detection systems known in the art, or combinations thereof. 
The term "cleavage structure" as used herein, refers to a structure that is formed 

25 by the interaction of at least one probe ohgonucleotide and a target nucleic acid, forming 
a structure comprising a duplex, the resulting structure being cleavable by a cleavage 
agent, including but not hmited to an enzyme. The cleavage structure is a substrate for 
specific cleavage by the cleavage means in contrast to a nucleic acid molecule that is a 
substrate for non-specific cleavage by agents such as phosphodiesterases that cleave 

30 nucleic acid molecules without regard to secondary structure (i.e., no formation of a 
duplexed structure is required). 



The term "folded cleavage structure" as used herein, refers to a region of a single- 
stranded nucleic acid substrate containing secondary structure, the region being cleavable 
by an enzymatic cleavage means. The cleavage structure is a substrate for specific 
cleavage by the cleavage means in contrast to a nucleic acid molecule that is a substrate 
5 for non-specific cleavage by agents such as phosphodiesterases that cleave nucleic acid 
molecules without regard to secondary structure (i.e., no folding of the substrate is 
required). 

As used herein, the term "folded target" refers to a nucleic acid strand that 
contains at least one region of secondary structure (i.e., at least one double stranded 
10 region and at least one single-stranded region within a single strand of the nucleic acid), 
y A folded target may comprise regions of tertiary structure in addition to regions of 

K secondary structure. 

£ The term "cleavage means" or "cleavage agent" as used herein refers to any 

f I means that is capable of cleaving a cleavage structure, inchiding but not hmited to 

ft 1 5 enzymes. The cleavage means may include native DNAPs having 5' nuclease activity 
h (e.g. , Taq DNA polymerase, E. coli DNA polymerase I) and, more specifically, modified 

W DNAPs having 5' nuclease but lackmg synthetic activity. "Structure-specific nucleases" 

£ or "structure-specific enzymes" are enzymes that recognize specific secondary structures 

rf in a nucleic acid molecule and cleave these structures. The cleavage means of the 

20 invention cleave a nucleic acid molecule in response to the formation of cleavage 

structures; it is not necessary that the cleavage means cleave the cleavage stiiicture at any 
particular location within the cleavage structure. 

The cleavage means is not restricted to enzymes having solely 5' nuclease 
activity. The cleavage means may include nuclease activity provided from a variety of 
25 sources including the CLE A VASE enzymes, the FEN- 1 endonucleases (including RAD2 
and XPG protems), Taq DNA polymerase and E. coli DNA polymerase I. 

The term "thermostable" when used in reference to an enzyme, such as a 5' 
nuclease, indicates that the enzyme is functional or active (i.e., can perform catalysis) at 
an elevated temperature, i.e., at about 55°C or higher. 
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The term "cleavage products" as used herein, refers to products generated by the 
reaction of a cleavage means with a cleavage structure (i.e., the treatment of a cleavage 
structure with a cleavage means). 

The term "target nucleic acid" refers to a nucleic acid molecule containing a 
5 sequence that has at least partial complementarity with at least a probe oligonucleotide 
and may also have at least partial complementarity with an INVADER oligonucleotide. 
The target nucleic acid may comprise single- or double-stranded DNA or RNA, and may 
comprise nucleotide analogs, labels, and other modifications. 

The term "probe ohgonucleotide" refers to an oligonucleotide that interacts with a 
1 0 target nucleic acid to form a cleavage structure in the presence or absence of an 
INVADER oligonucleotide. When annealed to the target nucleic acid, the probe 
oligonucleotide and target form a cleavage structure and cleavage occurs within the probe 
oligonucleotide. 

The term "non-target cleavage product" refers to a product of a cleavage reaction 

1 5 that is not derived from the target nucleic acid. As discussed above, in the methods of the 
present invention, cleavage of the cleavage structure generally occurs within the probe 
ohgonucleotide. The fragments of the probe oligonucleotide generated by this target 
nucleic acid-dependent cleavage are "non-target cleavage products." 

The term "INVADER oligonucleotide" refers to an ohgonucleotide that 

20 hybridizes to a target nucleic acid at a location near the region of hybridization between a 
probe and the target nucleic acid, wherein the INVADER oligonucleotide comprises a 
portion {e.g. , a chemical moiety, or nucleotide— whether complementary to that target or 
not) that overiaps with the region of hybridization between the probe and target. In some 
embodiments, the INVADER oligonucleotide contains sequences at its 3' end that are 

25 substantially the same as sequences located at the 5' end of a probe oligonucleotide. 

The term "substantially single-stranded" when used in reference to a nucleic acid 
substrate means that the substrate molecule exists primarily as a single sfrand of nucleic 
acid in contrast to a double-stranded substrate which exists as two strands of nucleic acid 
which are held together by inter-strand base pairing interactions. 

30 The term "sequence variation" as used herein refers to differences in nucleic acid 

sequence between two nucleic acids. For example, a wild-type structural gene and a 
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mutant form of this wild-type structural gene may vary in sequence by the presence of 
single base substitutions and/or deletions or insertions of one or more nucleotides. These 
two forms of the structural gene are said to vary in sequence from one another. A second 
mutant form of the structural gene may exist. This second mutant form is said to vary in 

5 sequence from both the wild-type gene and the first mutant form of the gene. 

The term "liberating" as used herein refers to the release of a nucleic acid 
fragment from a larger nucleic acid fragment, such as an oligonucleotide, by the action 
of, for example, a 5' nuclease such that the released fragment is no longer covalently 
attached to the remainder of the oUgonucleotide. 

1 0 The term "Km" as used herein refers to the Michaelis-Menten constant for an 

enzyme and is defined as the concentration of the specific substrate at which a given 
enzyme yields one-half its maximum velocity in an enzyme catalyzed reaction. 

The term "nucleotide" as used herein includes, but is not lunited to, naturally 
occurring and/or synthetic nucleotides, nucleotide analogs, and nucleotide derivatives. 

1 5 For example, the term includes naturally occurring DNA or RNA monomers, nucleotides 
with backbone modifications such as peptide nucleic acid (PNA) (M. Egholm et al., 
Nature 365:566 [1993]), phosphorothioate DNA, phosphorodithioate DNA, 
phosphoramidate DNA, aminde-linked DNA, MMI-linked DNA, 2'-0-methyl RNA, 
alpha-DNA and methylphosphonate DNA, nucleotides with sugar modifications such as 

20 2'-0-methyl RNA, 2'-fluoro RNA, 2'-amino RNA, 2'-0-alkyl DNA, 2'-0-allyl DNA, 
2'-0-alkynyl DNA, hexose DNA, pyranosyl RNA, and anhydrohexitol DNA, and 
nucleotides having base modifications such as C-5 substituted pyrimidines (substituents 
including fluoro-, bromo- chloro-, iodo-, methyl-, ethyl-, vinyl-, formyl-, ethynyl-, 
propynyl-, alkynyl-, thiazoyl-, imidazolyl-, pyridyl-), 7-deazapurines with C-7 

25 substituents including fluoro-, bromo-, chloro-, iodo-, methyl-, ethyl-, vinyl-, formyl-, 
alkynyl-, alkenyl-, thiazolyl-, imidazolyl-, pyridyl-), inosine and diaminopurine. 

The term "base analog" as used herein refers to modified or non-naturally 
occurring bases such as 7-deaza purines (e.g., 7-deaza-adenine and 7-deaza-guanine); 
bases modified, for example, to provide altered interactions such as non-standard 

30 basepairing, including, but not limited to: IsoC, Iso G, and other modified bases and 
nucleotides described in U.S. Patent Nos. 5,432,272; 6,001,983; 6,037,120; 6,140,496; 
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5,912,340; 6,127,121 and 6,143,877, each of which is incorporated herein by reference in 
their entireties; heterocychc base analogs based no the purine or pyrimidine ring systems, 
and other heterocyclic bases. Nucleotide analogs include base analogs and comprise 
modified forms of deoxyribonucleotides as well as ribonucleotides, 
5 The term "polymorphic locus" is a locus present in a population that shows 

variation between members of the population (e.g.., the most common allele has a 
frequency of less than 0.95). In contrast, a "monomorphic locus" is a genetic locus at 
Uttle or no variations seen between members of the population (generally taken to be a 
locus at which the most common allele exceeds a frequency of 0.95 in the gene pool of 
10 the population). 

The term "microorganism" as used herein means an organism too small to be 
observed with the unaided eye and includes, but is not limited to bacteria, virus, 
protozoans, fungi, and ciliates. 

The term "microbial gene sequences" refers to gene sequences derived from a 
15 microorganism. 

The term "bacteria" refers to any bacterial species including eubacterial and 
archaebacterial species. 

The term "virus" refers to obligate, ultramicroscopic, intracellular parasites 
incapable of autonomous replication (i.e., repUcation requires the use of the host cell's 
20 machinery). 

The term "multi-drug resistant" or multiple-drug resistant" refers to a 
microorganism that is resistant to more than one of the antibiotics or antimicrobial agents 
used in the treatment of said microorganism. 

The term "sample" in the present specification and claims is used in its broadest 
25 sense. On the one hand it is meant to include a specimen or culture (e.g., microbiological 
cultures). On the other hand, it is meant to include both biological and environmental 
samples. A sample may include a specimen of synthetic origin. 

Biological samples may be animal, including human, fluid, solid (e.g., stool) or 
tissue, as well as liquid and solid food and feed products and ingredients such as dairy 
30 items, vegetables, meat and meat by-products, and waste. Biological samples may be 
obtained from all of the various famihes of domestic animals, as well as feral or wild 
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animals, including, but not limited to, such animals as ungulates, bear, fish, lagamorphs, 
rodents, etc. 

Environmental samples include environmental material such as surface matter, 
soil, water and industrial samples, as well as samples obtained from food and dairy 

5 processing instruments, apparatus, equipment, utensils, disposable and non-disposable 
items. These examples are not to be construed as limiting the sample types apphcable to 
the present invention. 

The term "source of target nucleic acid" refers to any sample that contains nucleic 
acids (RNA or DNA). Particularly preferred sources of target nucleic acids are biological 

10 samples including, but not limited to blood, saliva, cerebral spinal fluid, pleural fluid, 
milk, lymph, sputum and semen. 

An oligonucleotide is said to be present in "excess" relative to another 
oligonucleotide (or target nucleic acid sequence) if that oUgonucleotide is present at a 
higher molar concentration that the other oligonucleotide (or target nucleic acid 

15 sequence). When an oUgonucleotide such as a probe oligonucleotide is present in a 
cleavage reaction in excess relative to the concentration of the complementary target 
nucleic acid sequence, the reaction may be used to indicate the amount of the target 
nucleic acid present. Typically, when present in excess, the probe oligonucleotide will be 
present at least a 100-fold molar excess; typically at least 1 pmole of each probe 

20 oligonucleotide would be used when the target nucleic acid sequence was present at 
about 10 fmoles or less. 

A sample "suspected of containing" a first and a second target nucleic acid may 
contain either, both or neither target nucleic acid molecule. 

The term "charge-balanced" oligonucleotide refers to an oligonucleotide (the 

25 input oUgonucleotide in a reaction) that has been modified such that the modified 

oUgonucleotide bears a charge, such that when the modified oUgonucleotide is either 
cleaved (i.e., shortened) or elongated, a resulting product bears a charge different from 
the input oUgonucleotide (the "charge-unbalanced" oligonucleotide) thereby permitting 
separation of the input and reacted oUgonucleotides on the basis of charge. The term 

30 "charge-balanced" does not imply that the modified or balanced oligonucleotide has a net 
neutral charge (although this can be the case). Charge-balancing refers to the design and 
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modification of an oligonucleotide such that a specific reaction product generated from 
this input oligonucleotide can be separated on the basis of charge from the input 
oUgonucleotide. 

For example, in an INVADER oligonucleotide-directed cleavage assay in which 
the probe oligonucleotide bears the sequence: 5' TTCTTTTCACCAGCGAGACGGG 3' 
{i.e., SEQ ID NO: 136 without the modified bases) and cleavage of the probe occurs 
between the second and third residues, one possible charge-balanced version of this 
oligonucleotide would be: 5' Cy3-AminoT-Amino-TCTTTTCACCAGCGAGAC GGG 
3'. This modified oligonucleotide bears a net negative charge. After cleavage, the 
following oligonucleotides are generated: 5' Cy3-AminoT-Amino-T 3'and 5' 
CTTTTCACCAGCGAGACGGG 3' (residues 3-22of SEQ ID NO: 136). 5' Cy3- 
AminoT-Amino-T 3' bears a detectable moiety (the positively-charged Cy3 dye) and two 
Mnino-modified bases. The amino-modified bases and the Cy3 dye contribute positive 
charges in excess of the negative charges contributed by the phosphate groups and thus 
the 5' Cy3-AminoT-Amino-T 3'oligonucleotide has a net positive charge. The other, 
longer cleavage fragment, like the input probe, bears a net negative charge. Because the 
5' Cy3-AminoT-Amino-T 3'fragment is separable on the basis of charge from the input 
probe (the charge-balanced oligonucleotide), it is referred to as a charge-unbalanced 
oligonucleotide. The longer cleavage product cannot be separated on the basis of charge 
from the input oligonucleotide as both oUgonucleotides bear a net negative charge; thus, 
the longer cleavage product is not a charge-unbalanced oUgonucleotide. 

The term "net neufral charge" when used in reference to an ohgonucleotide, 
includmg modified oligonucleotides, indicates that the sum of the charges present (i.e., R- 
NH3+ groups on thymidines, the N3 nitrogen of cytosine, presence or absence or 
phosphate groups, etc.) under the desired reaction or separation conditions is essentially 
zero. An oUgonucleotide having a net neutral charge would not migrate in an electrical 
field. 

The term "net positive charge" when used in reference to an oUgonucleotide, 
including modified oligonucleotides, indicates that the sum of the charges present (i.e., R- 
NH3+ groups on thymidines, the N3 nitrogen of cytosine, presence or absence or 
phosphate groups, etc.) under the desired reaction conditions is +1 or greater. An 
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oligonucleotide having a net positive charge would migrate toward the negative electrode 

in an electrical field. 

The term "net negative charge" when used in reference to an oUgonucleotide, 

including modified ohgonucleotides, indicates that the sum of the charges present (i.e., R- 
5 NH3+ groups on thymidines, the N3 nitrogen of cytosine, presence or absence or 

phosphate groups, etc.) under the desired reaction conditions is -1 or lower. An 

oligonucleotide having a net negative charge would migrate toward the positive electrode 

in an electrical field. 

The term "polymerization means" or "polymerization agent" refers to any agent 
1 0 capable of facilitating the addition of nucleoside triphosphates to an oligonucleotide. 

Preferred polymerization means comprise DNA and RNA polymerases. 

The term "ligation means" or "ligation agent" refers to any agent capable of 

faciUtating the Hgation (i.e., the formation of a phosphodiester bond between a 3'-0H and 

a 5' P located at the termini of two strands of nucleic acid). Preferred ligation means 
1 5 comprise DNA Ugases and RNA Hgases. 

The term "reactant" is used herein in its broadest sense. The reactant can 

comprise, for example, an enzymatic reactant, a chemical reactant or Hght (e.g., 

ultraviolet light, particularly short wavelength ultraviolet light is known to break 

oUgonucleotide chains). Any agent capable of reacting with an oUgonucleotide to either 
20 shorten (i.e., cleave) or elongate the oligonucleotide is encompassed within the term 

"reactant." 

The term "adduct" is used herein in its broadest sense to indicate any compound 
or element that can be added to an oUgonucleotide. An adduct may be charged 
(positively or negatively) or may be charge-neutral. An adduct may be added to the 

25 oligonucleotide via covalent or non-covalent linkages. Examples of adducts include, but 
are not Umited to, indodicarbocyanine dye amidites, amino-substituted nucleotides, 
ethidium bromide, ethidium homodimer, (l,3-propanediamino)propidium, 
(diethylenetriamino)propidium, thiazole orange, (N-N'-tetramethyl-l,3- 
propanediamino)propyl thiazole orange, (N-N'-tetramethyl-l,2-ethanediamino)propyl 

30 thiazole orange, thiazole orange-thiazole orange homodimer (TOTO), thiazole orange- 
thiazole blue heterodimer (TOTAB), thiazole orange-ethidium heterodimer 1 (TOEDl), 
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thiazole orange-ethidium heterodimer 2 (T0ED2) and fluorescein-ethidium heterodimer 
(FED), psoralens, biotin, streptavidin, avidin, etc. 

Where a first oligonucleotide is complementary to a region of a target nucleic acid 
and a second oligonucleotide has complementary to the same region (or a portion of this 
5 region) a "region of sequence overlap" exists along the target nucleic acid. The degree of 
overlap will vary depending upon the nature of the complementarity (see, e.g., region "X" 
in Figs. 29 and 67 and the accompanying discussions). 

As used herein, the term "purified" or "to purify" refers to the removal of 
contaminants from a sample. For example, recombinant CLEAVASE nucleases are 
10 expressed in bacterial host cells and the nucleases are purified by the removal of host cell 
proteins; the percent of these recombinant nucleases is thereby increased in the sample. 

The term "recombinant DNA molecule" as used herein refers to a DNA molecule 
that comprises of segments of DNAjoined together by means of molecular biological 
techniques. 

1 5 The term "recombinant protein" or "recombinant polypeptide" as used herein 

refers to a protein molecule that is expressed from a recombinant DNA molecule. 

As used herein the term "portion" when in reference to a protein (as in "a portion 
of a given protein") refers to fragments of that protein. The fragments may range in size 
from four amino acid residues to the entire amino acid sequence minus one amino acid 

20 (e.g.,4,5,6,...,n-l). 

The term "nucleic acid sequence" as used herein refers to an oligonucleotide, 
nucleotide or polynucleotide, and fragments or portions thereof, and to DNA or RNA of 
genomic or synthetic origin that may be single or double stranded, and represent the 
sense or antisense strand. Similarly, "amino acid sequence" as used herein refers to 

25 peptide or protein sequence. 

The term "peptide nucleic acid" ("PNA") as used herein refers to a molecule 
comprising bases or base analogs such as would be found in natural nucleic acid, but 
attached to a peptide backbone rather than the sugar-phosphate backbone typical of 
nucleic acids. The attachment of the bases to the peptide is such as to allow the bases to 

30 base pair with complementary bases of nucleic acid in a manner similar to that of an 

oligonucleotide. These small molecules, also designated anti gene agents, stop transcript 
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elongation by binding to their complementary strand of nucleic acid (Nielsen, et al. 
Anticancer Drug Des. 8:53 63 [1993]). 

As used herein, the terms "purified" or "substantially purified" refer to molecules, 
either nucleic or amino acid sequences, that are removed from their natural environment, 
5 isolated or separated, and are at least 60% free, preferably 75% free, and most preferably 
90% free from other components with which they are naturally associated. An "isolated 
polynucleotide" or "isolated ohgonucleotide" is therefore a substantially purified 
polynucleotide. 

As used herein, the term "fiision protein" refers to a chimeric protein 
10 containing the protein of interest (e.g., CLEAVASE BN/thrombin nuclease and portions 

0 or fragments thereof) joined to an exogenous protein fragment (the fusion partner which 

1 consists of a non CLEAVASE BN/thrombin nuclease protem). The fusion partner may 
S enhance solubility of recombinant chimeric protein (e.g., the CLEAVASE BN/thrombin 
S nuclease) as expressed in a host cell, may provide an affinity tag (e.g., a his-tag) to allow 
W 1 5 purification of the recombinant fusion protein from the host cell or culture supernatant, or 

both. If desired, the fusion protein may be removed from the protein of interest (e.g., 
£ CLEAVASE BN/thrombin nuclease or fragments thereof) by a variety of enzymatic or 

chemical means known to the art. 
5 As used herein, the terms "chimeric protein" and "chimerical protein" refer to a 

^ 20 single protein molecule that comprises amino acid sequences portions derived from two 
or more parent proteins. These parent molecules may be from similar proteins from 
genetically distinct origins, different proteins from a single organism, or different 
proteins from different organisms. By way of example but not by way of limitation, a 
chimeric structure-specific nuclease of the present invention may contain a mixture of 
25 amino acid sequences that have been derived from FEN-1 genes from two or more of the 
organisms having such genes, combined to form a non-naturally occurring nuclease. The 
term "chimerical" as used herein is not intended to convey any particular proportion of 
contribution from the naturally occurring genes, nor Umit the manner in which the 
portions are combined. Any chimeric structure-specific nuclease constructs having 
30 cleavage activity as determined by the testing methods described herein are improved 
cleavage agents within the scope of the present invention. 
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The term "continuous strand of nucleic acid" as used herein is means a strand of 
nucleic acid that has a continuous, covalently linked, backbone structure, without nicks or 
other disruptions. The disposition of the base portion of each nucleotide, whether 
base-paired, single-stranded or mismatched, is not an element in the defmition of a 

5 continuous strand. The backbone of the continuous strand is not hmited to the 

ribose-phosphate or deoxyribose-phosphate compositions that are found in naturally 
occurring, unmodified nucleic acids. A nucleic acid of the present invention may 
comprise modifications in the structure of the backbone, including but not limited to 
phosphorothioate residues, phosphonate residues, 2' substituted ribose residues {e.g., 

10 2'-0-methyl ribose) and alternative sugar (e.g., arabinose) containing residues. 

The term "continuous duplex" as used herein refers to a region of double stranded 
nucleic acid in which there is no disruption in the progression of basepairs within the 
duplex (i.e., the base pairs along the duplex are not distorted to accommodate a gap, 
bulge or mismatch with the confines of the region of continuous duplex). As used herein 

1 5 the term refers only to the arrangement of the basepairs within the duplex, without 
implication of continuity in the backbone portion of the nucleic acid strand. Duplex 
nucleic acids with uninterrupted basepairing, but with nicks in one or both strands are 
within the definition of a continuous duplex. 

The term "duplex" refers to the state of nucleic acids in which the base portions of 

20 the nucleotides on one strand are bound through hydrogen bonding the their 

complementary bases arrayed on a second strand. The condition of being in a duplex 
form reflects on the state of the bases of a nucleic acid. By virtue of base pairing, the 
strands of nucleic acid also generally assume the tertiary structure of a double helix, 
having a major and a minor groove. The assumption of the helical form is implicit in the 

25 act of becoming duplexed. 

The term "duplex dependent protein binding" refers to the binding of proteins to 
nucleic acid that is dependent on the nucleic acid being in a duplex, or helical form. 

The term "duplex dependent protein binding sites or regions" as used herein refers 
to discrete regions or sequences within a nucleic acid that are bound with particular 

30 affinity by specific duplex-dependent nucleic acid binding proteins. This is in contrast to 
the generalized duplex-dependent binding of proteins that are not site-specific, such as 



51 



the histone proteins that bind chromatin with Httle reference to specific sequences or 
sites. 

The term "protein-binding region" as used herein refers to a nucleic acid region 
identified by a sequence or structure as binding to a particular protein or class of proteins. 

5 It is within the scope of this definition to include those regions that contain sufficient 
genetic information to allow identifications of the region by comparison to known 
sequences, but which might not have the reqtiisite structure for actual binding {e.g., a 
single strand of a duplex-depending nucleic acid binding protein site). As used herein 
"protein binding region" excludes restriction endonuclease binding regions. 

1 0 The term "complete double stranded protein binding region" as used herein refers 

to the minimum region of continuous duplex required to allow binding or other activity of 
a duplex-dependent protein. This definition is intended to encompass the observation 
that some duplex dependent nucleic acid binding proteins can interact with fiiU activity 
with regions of duplex that may be shorter than a canonical protein binding region as 

1 5 observed in one or the other of the two single strands. In other words, one or more 
nucleotides in the region may be allowed to remain unpaired without suppressing 
binding. As used here in, the term "complete double stranded binding region" refers to 
the minimum sequence that will accommodate the binding function. Because some such 
regions can tolerate non-duplex sequences in multiple places, although not necessarily 

20 simultaneously, a single protein binding region might have several shorter sub-regions 
that, when duplexed, will be fully competent for protein binding. 

The term "template" refers to a strand of nucleic acid on which a complementary 
copy is built from nucleoside triphosphates through the activity of a template-dependent 
nucleic acid polymerase. Within a duplex the template strand is, by convention, depicted 

25 and described as the "bottom" strand. Similarly, the non-template strand is often depicted 
and described as the "top" strand. 

The term "template-dependent RNA polymerase" refers to a nucleic acid 
polymerase that creates new RNA strands through the copying of a template strand as 
described above and which does not synthesize RNA in the absence of a template. This 

30 is in contrast to the activity of the template-independent nucleic acid polymerases that 
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synthesize or extend nucleic acids without reference to a template, such as terminal 
deoxynucleotidyl transferase, or Poly A polymerase. 

The term " ARRESTOR molecule" refers to an agent added to or included in an 
invasive cleavage reaction in order to stop one or more reaction components from 
participating in a subsequent action or reaction. This may be done by sequestering or 
inactivating some reaction component (e.g., by binding or base-pairing a nucleic acid 
component, or by binding to a protein component). The term "ARRESTOR 
oligonucleotide" refers to an oUgonucleotide included in an invasive cleavage reaction in 
order to stop or arrest one or more aspects of any reaction (e.g., the first reaction and/or 
any subsequent reactions or actions; it is not intended that the ARRESTOR 
oUgonucleotide be limited to any particular reaction or reaction step). This may be done 
by sequestering some reaction component (e.g., base-pairing to another nucleic acid, or 
binding to a protein component). However, it is not intended that the term be so limited 
as to just situations in which a reaction component is sequestered. 

As used herein, the term "kit" refers to any dehvery system for deUvering 
materials. In the context of reaction assays, such delivery systems include systems that 
allow for the storage, transport, or delivery of reaction reagents (e.g., oUgonucleotides, 
enzymes, etc. in the appropriate containers) and/or supporting materials (e.g., buffers, 
written instructions for performing the assay etc.) from one location to another. For 
example, kits include one or more enclosures (e.g., boxes) containing the relevant 
reaction reagents and/or supporting materials. As used herein, the term "fragmented kif 
refers to a delivery systems comprising two or more separate containers that each contain 
a subportion of the total kit components. The containers may be deHvered to the intended 
recipient together or separately. For example, a first container may contain an enzyme 
for use in an assay, while a second container contains oligonucleotides. The term 
"fragmented kit" is intended to encompass kits containing Analyte specific reagents 
(ASR's) regulated under section 520(e) of the Federal Food, Drug, and Cosmetic Act, but 
are not limited thereto. Indeed, any delivery system comprising two or more separate 
containers that each contains a subportion of the total kit components are included in the 
term "fragmented kit." In contrast, a "combined kit" refers to a dehvery system 
containing all of the components of a reaction assay in a single container (e.g., in a single 
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box housing each of the desired components). The term "kit" includes both fragmented 
and combined kits. 

As used herein, the term "functional domain" refers to a region, or a part of a 
region, of a protein (e.g., an enzyme) that provides one or more functional characteristic 
5 of the protein. For example, a functional domain of an enzyme may provide, directly or 
indirectly, one or more activities of the enzyme including, but not limited to, substrate 
binding capabihty and catalytic activity. A functional domain may be characterized 
through mutation of one or more amino acids within the functional domain, wherein 
mutation of the amino acid(s) alters the associated functionality (as measured empirically 

10 in an assay) thereby indicating the presence of a functional domain. 

As used herein, the term "heterologous functional domain" refers to a protein 
functional domain that is not in its natural environment. For example, a heterologous 
functional domain includes a functional domain from one enzyme introduced into another 
enzyme. A heterologous functional domain also includes a functional domain native to a 

1 5 protein that has been altered in some way (e.g., mutated, added in multiple copies, etc.). 
A heterologous functional domain may comprise a plurality of contiguous amino acids or 
may include two or more distal amino acids are amino acids fragments (e.g., two or more 
amino acids or fragments with intervening, non-heterologous, sequence). Heterologous 
functional domains are distinguished from endogenous functional domains in that the 

20 heterologous amino acid(s) are joined to or contain amino acid sequences that are not 

found naturally associated with the amino acid sequence in nature or are associated with a 
portion of a protein not found in nature. 

As used herein, the term "altered functionality in a nucleic acid cleavage assay" 
refers to a characteristic of an enzyme that has been aUered in some manner to differ from 

25 its natural state (e.g., to differ from how it is found in nature). Alterations include, but 

are not limited to, addition of a heterologous functional domain (e.g., through mutation or 
through creation of chimerical proteins). In some embodiments, the altered characteristic 
of the enzyme may be one that improves the performance of an enzyme in a nucleic acid 
cleavage assay. Types of improvement include, but are not limited to, improved nuclease 

30 activity (e.g., improved rate of reaction), improved substrate binding (e.g., increased or 
decreased binding of certain nucleic acid species [e.g., RNA or DNA] that produces a 
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desired outcome [e.g., greater specificity, improved substrate turnover, etc.]), and 
improved background specificity (e.g., less undesired product is produced). The present 
invention is not limited by the nucleic cleavage assay used to test improved functionality. 
However, in some preferred embodiments of the present invention, an invasive cleavage 

5 assay is used as the nucleic acid cleavage assay. In certain particularly preferred 

embodiments, an invasive cleavage assay utilizing an RNA target is used as the nucleic 
acid cleavage assay. 

As used herein, the terms "N-terminal" and "C-terminal" in reference to 
polypeptide sequences refer to regions of polypeptides including portions of the N- 

10 terminal and C-terminal regions of the polypeptide, respectively. A sequence that 
includes a portion of the N-terminal region of polypeptide includes amino acids 
predominantly from the N-terminal half of the polypeptide chain, but is not limited to 
such sequences. For example, an N-terminal sequence may include an interior portion of 
the polypeptide sequence including bases from both the N-terminal and C-terminal halves 

15 of the polypeptide. The same apphes to C-terminal regions. N-terminal and C-terminal 
regions may, but need not, include the amino acid defining the ultimate N-terminal and 
C-terminal ends of the polypeptide, respectively. 

DESCRIPTION OF THE DRAWINGS 

20 Figure 1 shows a schematic representation of sequential invasive cleavage 

reactions. In step A, an upstream INVADER oligonucleotide and a downsfream probe 
combine with a target nucleic acid strand to form a cleavage structure. In step B, the 
portion of the cleaved signal probe from A combines with a second target nucleic acid 
strand and a labeled signal probe to form a second cleavage structure. In step C, cleavage 

25 of the labeled second cleavage structure yields a detectable signal. 

Figure 2 shows schematic representations of several examples of invasive 
cleavage structures comprising RNA target strands (SEQ ID NO: 141). Panel A depicts 
an INVADER oligonucleotide (SEQ ID NO:142) and probe (SEQ ID NO:143). Panel B 
depicts an INVADER oligonucleotide (SEQ ID NO: 144) and probe (SEQ ID NO: 143). 

30 Panel C depicts an INVADER oligonucleotide (SEQ ID NO: 145) and probe (SEQ ID 
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NO:145). Panel D depicts an INVADER oligonucleotide (SEQ ID NO:145) and probe 
(SEQ ID NO: 146). 

Figure 3 shows schematic representations of two examples of structures that are 
not invasive cleavage structures labelled SEQ ID NOs:147-152. 

5 Figure 4 shows a schematic representation of a configuration of invasive cleavage 

that is useful for detection of target sequence variations. In A, an invasive cleavage 
structure having overlap between the two probes is formed, and the arrow indicates that it 
is cleavable by the enzymes of the present invention. In B, variation of the target 
sequence removes a region of complementarity to the downstream probe and eliminates 

1 0 the overlap. The absence of an arrow in panel B indicates a reduced rate of cleavage of 
this structure compared to that diagrammed in panel A. 

Figure 5 shows a diagram of the X-ray structure of a ternary complex of Klentaql 
with primer/template DNA in the polymerizing mode determined by Li et al (Li et al. 
Protein Sci., 7:1 116 [1998]), Without intending to represent precise borders between 

15 features of the physical form, the portions referred to in the text as the "fingers", "thumb" 
and "palm" regions are loosely indicated by the circle, rectangle, and oval, respectively. 

Figure 6 shows a schematic diagram of the DNA polymerase gene from Thermus 
aquaticus. Restriction sites used in these studies are indicated above. The approximate 
regions encoding various structural or functional domains of the protein are indicated by 

20 double-headed arrows, below. 

Figure 7 shows a schematic diagram of the chimeric constructs comprising 
portions of the TaqPol gene and the TthPol gene. Open and shaded boxes denote TaqPol 
and TthPol sequences, respectively. The numbers correspond to the amino acid sequence 
of TaqPol. The 5' nuclease and polymerase domains of TaqPol and the palm, thumb, and 

25 fingers regions of the polymerase domain are indicated. The abbreviations for the 

restrictions sites used for recombination are as follows: E, EcoRI; N, NotI; Bs, BstBI; D, 
Ndel; B, BamHI; and S, Sail. 

Figure 8A-H shows a comparison of the nucleotide structure of the polymerase 
genes isolated from Thermus aquaticus (SEQ ID NO: 153), Thermus flavus (SEQ ID 

30 NO: 154) and Thermus thermophilus (SEQ ID NO: 155); the consensus sequence (SEQ ID 
NO: 156) is shown at the top of each row. 
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Figure 9A-C shows a comparison of the amino acid sequence of the polymerase 
isolated from Thermus aquaticus (SEQ ID NO:157), Thermus flavus (SEQ ID NO:158), 
and Thermus thermophilus (SEQ ID N0:1); the consensus sequence (SEQ ID NO: 159) is 
shown at the top of each row. 

Figure 10 shows the sequences and proposed structures of substrates for the 
invasive signal amplification reaction with human IL-6 RNA target strand (SEQ ID 
NO:160) and upstream probe (SEQ ID N0:161). The cleavage site of the downstream 
probe (SEQ ID NO: 162) is indicated by an arrow. Sequence of the IL-6 DNA target 
strand (SEQ ID NO:163) is shown below. 

Figure 1 1 shows the image generated by a fluorescence imager showing the 
products of invasive cleavage assays using the indicated enzymes, and the IL-6 substrate 
of Figure 10 having either a DNA target strand (A) or an RNA target strand (B). 

Figure 12 compares the cycling cleavage activities of Taq DN RX HT, Tth DN 
RX HT, and Taq-Tth chimerical enzymes with IL-6 substrate having an RNA target 
strand. 

Figure 13 shows a comparison of the amino acid sequences of the Bstl-BamHI 
fragments of TaqPol (SEQ ID NO:164) and TthPol (SEQ ID NO:165). Pairs of similar 
amino acids are shaded with light gray. AHgned amino acids that have a charge 
difference are shaded with dark gray. The numbers correspond to the amino acid 
sequence of TaqPol. Amino acids of TaqPol changed to the corresponding amino acids 
of TthPol by site-directed mutagenesis are indicated by (+). 

Figure 14 compares the cycling cleavage activities of Taq DN RX HT, Taq-Tth 
chimerical enzymes, and chimerical enzymes having the indicated additional amino acid 
modifications, with IL-6 substrate having an RNA target strand. 

Figure 15 compares the cycling cleavage activities of Taq DN RX HT, Tth DN 
RX HT, and Taq DN RX HT having the indicated amino acid modifications, with IL-6 
substrate having an RNA target strand. 

Figure 16 compares polymerization activities of TaqPol, TthPol, and Taq-Tth 
chimerical enzymes, and TaqPol having the indicated amino acid modifications. 
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Figure 17 shows a diagram of the X-ray structure of a ternary complex of 
Klentaql with primer/template DNA in the polymerizing mode determined by Li et al. 
(Li et al. Protein Sci., 7:1 1 16 [1998]). Amino acids G418 and E507 are indicated. 

Figures 18 A-D show schematic diagrams of examples of substrates that maybe 

5 used to measure various cleavage activities of enzymes. The substrates may be labeled, 
for example, with a fluorescent dye and a quenching moiety for FRET detection, as 
shown, to facilitate detection and measurement. The substrates of 18A and 18B are 
invasive cleavage structures having RNA and DNA target strands, respectively. 1 8C 
shows an example of an X-structure, and 18D shows an example of a hairpin structure, 

1 0 both of which may be used to assess the activity of enzymes on alternative structures that 
may be present in invasive cleavage reactions. 

Figure 19 shows schematic diagrams of chimeric constructs comprising portions 
of the TaqPol gene and the TthPol gene. Open and shaded boxes denote TaqPol and 
TthPol sequences, respectively. The chimeras also include the DN, RX, and HT 

15 modifications. A table compares the cleavage activity of each protein on the indicated 
cleavage substrates. 

Figure 20A shows a schematic diagram for an RNA containing invasive cleavage 
substrate. The 5' end of the target molecule (SEQ ID NO:166) is modified with biotin 
and blocked with streptavidm as described. The downstream probe (SEQ ID NO: 167) 
20 with cleavage site is also shown. Panels B-D show analysis of the properties of the Taq 
DN RX HT G418K/E507Q mutant in cleavage of the shown substrate under conditions 
of varying reaction temperature, KCl concentration, and MgS04 concentration. 

Figure 21 shows schematic diagrams for model substrates used to test enzymes 
for invasive cleavage activity. The molecule shown in 21 A provides a DNA target strand 
25 (SEQ ID NO: 1 68), while the model shown in 21B provides an RNA containing target 
strand (SEQ ID NO:167). Both 21A and B show downstream probe SEQ ID NO:166. 

Figure 22 shows schematic diagrams for model substrates used to test enzymes 
for cleavage activity on alternative, non-invasive structures. 

Figure 23 shows a schematic diagram for a model substrate used to test enzymes 
30 for invasive cleavage activity. 
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Figure 24 shows schematic diagrams for a model substrate used to test enzymes 
for invasive cleavage activity on RNA or DNA target strands. 

Figure 25 compares the cycling cleavage activities of Tth DN RX HT, Taq 2M, 
TfiPol, Tsc Pol, and Tfi and Tsc-derived mutant enzymes. 
5 Figure 26 depicts structures that may be employed to determine the ablity of an 

enzyme to cleave a probe in the presence and the absence of an upstream oligonucleotide. 
Fig. 26 displays the sequence of oligonucleotide 89-15-1 (SEQ ED NO:212), 
oligonucleotide 81-69-5 (SEQ ID NO:213), oHgonucleotide 81-69-4 (SEQ ID NO:214), 
ohgonucleotide 81-69-3 (SEQ ID NO:215), ohgonucleotide 81-69-2 (SEQ ID NO:216) 
10 and a portion of M13mpl8 (SEQ ID NO:217). 

Figure 27 shows the image generated by a fluorescence imager that shows the 
dependence of Pfu FEN-1 on the presence of an overlapping upstream ohgonucleotide for 
specific cleavage of the probe. 

Figure 28a shows the image generated by a fluorescence imager that compares the 
15 amount of product generated in a standard (i.e., a non-sequential invasive cleavage 
reaction) and a sequential invasive cleavage reaction. 

Figure 28b is a graph comparing the amount of product generated in a standard or 
basic {i.e., a non-sequential invasive cleavage reaction) and a sequential invasive 
cleavage reaction ("INVADER sqrd") (y axis = fluorescence units; x axix = attomoles of 
20 target). 

Figure 29 shows the image generated by a fluorescence imager that shows that the 
products of a completed sequential invasive cleavage reaction cannot cross contaminant a 
subsequent similar reaction. 

Figure 30 shows the sequence of the oligonucleotide employed in an invasive 
25 cleavage reaction for the detection of HCMV viral DNA; Fig. 30 shows the sequence of 
oligonucleotide 89-76 (SEQ ID NO:218), ohgonucleotide 89-44 (SEQ ID NO:219) and 
nucleotides 3057-3 110 of the HCMV genome (SEQ ID NO:220). 

Figure 31 shows the image generated by a fluorescence imager that shows the 
sensitive detection of HCMV viral DNA in samples containing human genomic DNA 
30 using an invasive cleavage reaction. 
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Figure 32 is a schematic that illustrates one embodiment of the present invention, 
where the cut probe from an initial invasive cleavage reaction is employed as the 
INVADERoligonucleotide in a second invasive cleavage reaction, and where an 
ARRESTOR oligonucleotide prevents participation of remaining uncut first probe in the 
5 cleavage of the second probe. 

Figure 33 is a schematic that illustrates one embodiment of the present invention, 
where the cut probe from an initial invasive cleavage reaction is employed as an 
integrated INVADER-target complex in a second invasive cleavage reaction, and where 
an ARRESTOR oligonucleotide prevents participation of remaining uncut first probe in 
1 0 the cleavage of the second probe. 

Figure 34 shows three images generated by a fluorescence imager showing that 
two different lengths of 2^ 0-methyl, 3^ terminal amine-modified ARRESTOR 
oUgonucleotide both reduce non-specific background cleavage of the secondary probe 
when included in the second step of a reaction where the cut probe from an initial 
1 5 invasive cleavage reaction is employed as an integrated INVADER-target complex in a 
second invasive cleavage reaction. 

Figure 35A shows two images generated by a fluorescence imager showing the 
effects on nonspecific and specific cleavage signal of increasing concentrations of 
primary probe in the first step of a reaction where the cut probe from an initial invasive 
20 cleavage reaction is employed as the INVADER oligonucleotide in a second invasive 
cleavage reaction. 

Figure 35B shows two images generated by a fluorescence imager showing the 
effects on nonspecific and specific cleavage signal of increasing concentrations of 
primary probe in the first step of a reaction, and inclusion of a 2' 0-methyl, 3' terminal 

25 amine-modified ARRESTOR oHgonucleotide in the second step of a reaction where the 
cut probe from an initial invasive cleavage reaction is employed as the INVADER 
oligonucleotide in a second invasive cleavage reaction. 

Figure 35C shows shows a graph generated using the spreadsheet MICROSOFT 
EXCEL software, comparing the effects on nonspecific and specific cleavage signal of 

30 increasing concentrations of primary probe in the first step of a reaction, in the presence 
or absence of a T 0-methyl, 3' terminal amine-modified ARRESTOR oHgonucleotide in 



the second step of a reaction where the cut probe from an initial invasive cleavage 
reaction is employed as the INVADER oligonucleotide in a second invasive cleavage 
reaction. 

Figure 36A shows two images generated by a fluorescence imager showing the 
5 effects on nonspecific and specific cleavage signal of including an unmodified 

ARRESTOR oligonucleotide in the second step of a reaction where the cut probe from an 
initial invasive cleavage reaction is employed as the INVADER oligonucleotide in a 
second invasive cleavage reaction. 

Figure 36B shows two images generated by a fluorescence imager showing the 
1 0 effects on nonspecific and specific cleavage signal of including a 3' terminal amine 
O modified ARRESTOR ohgonucleotide, a partially 2' 0-methyl substituted, 3' terminal 

m amine modified ARRESTOR oligonucleotide, or an entirely 2' 0-methyl, 3' terminal 

5 amine modified ARRESTOR ohgonucleotide in the second step of a reaction where the 

P cut probe from an initial invasive cleavage reaction is employed as the INVADER 

m 15 ohgonucleotide in a second invasive cleavage reaction. 

Figure 37A shows two images generated by a fluorescence imager comparing the 
fi effects on nonspecific and specific cleavage signal of inchiding ARRESTOR 

oligonucleotides of different lengths in the second step of a reaction where the cut probe 
O from an initial invasive cleavage reaction is employed as the INVADER oligonucleotide 

20 in a second invasive cleavage reaction. 

Figure 37B shows two images generated by a fluorescence imager comparing the 
effects on nonspecific and specific cleavage signal of including an arrestoer 
ohgonucleotides of different lengths in the second step of a reaction where the cut probe 
from an initial invasive cleavage reaction is employed as the INVADER ohgonucleotide 
25 in a second invasive cleavage reaction, and in which a longer variant of the secondary 
probe used in the reactions in Fig. 37A is tested. 

Figure 37C shows a schematic diagram of a primary probe aligned with several 
ARRESTOR oligonucleotides of different lengths. The region of the primary probe that 
is complementary to the HBV target sequence is underlined. The ARRESTOR 
30 oligonucleotides are aligned with the probe by complementarity. 
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Figure 38 shows two images generated by a fluorescence imager comparing the 
effects on nonspecific and specific cleavage signal of including ARRESTOR 
oligonucleotides of different lengths in the second step of a reaction where the cut probe 
from an initial invasive cleavage reaction is employed as the INVADER oHgonucIeotide 
in a second invasive cleavage reaction, using secondary probes of two different lengths. 

Figure 39 shows a graph of the calculated running average of a ten nucleotide 
stretch of the hUbiquitin RNA (the Ave(lO) Index) derived from the SS-count output of 
an mfold analysis, expressed as a percentage of the total number of structures found by 
mfold that include a particular base, plotted against the position of the base. 

Figure 40 shows an example microplate layout for an RNA INVADER assay 
comprising 40 samples, 6 standards, and a No Target Control. 

Figure 41 shows INVADER assay components for use in detecting human (h), 
mouse (m), or rat (r) RNAs of the indicated genes or franscripts. 

Figure 42 shows a computer display of an INVADERCREATOR Order Entry 

screen. 

Figure 43 shows a computer display of an INVADERCREATOR Multiple SNP 
Design Selection screen. 

Figure 44 shows a computer display of an INVADERCREATOR Designer 
Worksheet screen. 

Figure 45 shows a computer display of an INVADERCREATOR Output Page 

screen. 

Figure 46 shows a computer display of an INVADERCREATOR Printer Ready 
Output screen. 

Figure 47 shows INVADER assay components (SEQ ED NOs:709-2640) for use 
in detecting RNA target nucleic acids. Components are grouped per RNA analyte to be 
detected. Where multiple probes, INVADER ohgonucleotides, stacker oligonucleotides, 
ARRESTOR oligonucleotides, or other components are provided, any of the multiple 
components may be used, unless indicated otherwise. Unless indicated otherwise, 
ohgonucleotides are presented 5 '-3' orientation. 

Figure 48 shows a chart showing the Ave(lO) hidex against base pair position. 
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DESCRIPTION OF THE INVENTION 
Introduction 

The present invention relates to methods and compositions for treating nucleic 
5 acid, and in particular, methods and compositions for detection and characterization of 
nucleic acid sequences and sequence changes. 

In preferred embodiments, the present invention relates to means for cleaving a 
nucleic acid cleavage structure in a site-specific manner. While the present invention 
provides a variety of cleavage agents, in some embodiments, the present invention relates 
10 to a cleaving enzyme having 5' nuclease activity without interfering nucleic acid 
synthetic ability. In other embodiments, the present invention provides novel 
polymerases {e.g., thermostable polymerases) possessing altered polymerase and/or 
nucleases activities. 

For example, in some embodiments, the present invention provides 5* nucleases 

1 5 derived from thermostable DNA polymerases that exhibit altered DNA synthetic activity 
from that of native thermostable DNA polymerases. The 5' nuclease activity of the 
polymerase is retained while the synthetic activity is reduced or absent. Such 5' 
nucleases are capable of catalyzing the structure-specific cleavage of nucleic acids in the 
absence of interfering synthetic activity. The lack of synthetic activity during a cleavage 

20 reaction resuUs in nucleic acid cleavage products of uniform size. 

The novel properties of the nucleases of the invention form the basis of a method 
of detecting specific nucleic acid sequences. This method reUes upon the amplification 
of the detection molecule rather than upon the amphfication of the target sequence itself 
as do existing methods of detecting specific target sequences. 

25 DNA polymerases (DNAPs), such as those isolated from E. coli or from 

thermophilic bacteria of the genus Thermus as well as other organisms, are enzymes that 
synthesize new DNA strands. Several of the known DNAPs contain associated nuclease 
activities in addition to the synthetic activity of the enzyme. 

Some DNAPs are known to remove nucleotides from the 5' and 3' ends of DNA 

30 chains (Komberg, DNA RepUcation, W.H, Freeman and Co., San Francisco, pp. 127-139 
[1980]). These nuclease activities are usually referred to as 5' exonuclease and 3' 



63 



exonuclease activities, respectively. For example, the 5' exonuclease activity located in 
the N-terminal domain of several DNAPs participates in the removal of RNA primers 
during lagging strand synthesis during DNA repUcation and the removal of damaged 
nucleotides during repair. Some DNAPs, such as the E. coli DNA polymerase 
5 (DNAPEcl), also have a 3' exonuclease activity responsible for proof-reading during 
DNA synthesis (Komberg, supra). 

A DNAP isolated from Thermus aquaticus, termed Taq DNA polymerase 
(DNAPTaq), has a 5' exonuclease activity, but lacks a functional 3' exonucleolytic 
domain (Tindall and Kunkell, Biochem., 27:6008 [1988]). Derivatives of DNAPEcl and 
10 DNAPTaq, respectively called the Klenow and Stoffel fragments, lack 5* exonuclease 
domains as a result of enzymatic or genetic manipulations (Brutlag et al, Biochem. 
f Biophys. Res. Commun., 37:982 [1969]; Erlich et al.. Science 252:1643 [1991]; Setlow 

5 and Komberg, J. Biol. Chem., 247:232 [1972]). 

The 5' exonuclease activity of DNAPTaq was reported to requfre concurrent 
W 1 5 synthesis (Gelfand, PGR Technology - Principles and Applications for DNA 
I ' Amplification, H.A. Erlich, [Ed.], Stockton Press, New York, p. 19 [1989]). Although 

y mononucleotides predominate among the digestion products of the 5' exonucleases of 

nj DNAPTaq and DNAPEcl, short oligonucleotides (< 12 nucleotides) can also be observed 

O implying that these so-called 5' exonucleases can function endonucleolytically (Setlow, 

^ 20 supra; Holland et al., Proc. Natl. Acad. Sci. USA 88:7276 [1991]). 

In WO 92/06200, Gelfand et al. show that the preferred substrate of the 5' 
exonuclease activity of the thermostable DNA polymerases is displaced single-stranded 
DNA. Hydrolysis of the phosphodiester bond occurs between the displaced single- 
stranded DNA and the double-helical DNA with the preferred exonuclease cleavage site 
25 being a phosphodiester bond in the double helical region. Thus, the 5' exonuclease 
activity usually associated with DNAPs is a structure-dependent single-stranded 
endonuclease and is more properly referred to as a 5' nuclease. Exonucleases are 
enzymes that cleave nucleotide molecules from the ends of the nucleic acid molecule. 
Endonucleases, on the other hand, are enzymes that cleave the nucleic acid molecule at 
30 internal rather than terminal sites. The nuclease activity associated with some 

thermostable DNA polymerases cleaves endonucleolytically but this cleavage requires 
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contact with the 5' end of the molecule being cleaved. Therefore, these nucleases are 
referred to as 5' nucleases. 

When a 5' nuclease activity is associated with a eubacterial Type A DNA 
polymerase, it is found in the one third N-terminal region of the protein as an independent 

5 functional domain. The C-terminal two-thirds of the molecule constitute the 

polymerization domain that is responsible for the synthesis of DNA. Some Type A DNA 
polymerases also have a 3' exonuclease activity associated with the two-third C-terminal 
region of the molecule. 

The 5' exonuclease activity and the polymerization activity of DNAPs can be 

1 0 separated by proteolytic cleavage or genetic manipulation of the polymerase molecule. 
The Klenow or large proteolytic cleavage fragment of DNAPEcl contains the 
polymerase and 3' exonuclease activity but lacks the 5* nuclease activity. The Stoffel 
fragment of DNAPTaq (DNAPStf) lacks the 5^ nuclease activity due to a genetic 
manipulation that deleted the N-terminal 289 amino acids of the polymerase molecule 

15 (Erhch et al, Science 252:1643 [1991]). WO 92/06200 describes a thermostable DNAP 
with an altered level of 5' to 3* exonuclease. U.S. Patent No. 5,108,892 describes a 
Thermus aquaticus DNAP without a 5' to 3' exonuclease. Thermostable DNA 
polymerases with lessened amounts of synthetic activity are available (Third Wave 
Technologies, Madison, WI) and are described in U.S. Pat. Nos. 5,541,311, 5,614,402, 

20 5,795,763, 5,691 , 142, and 5,837,450, herein incorporated by reference in their entireties. 
The present invention provides 5' nucleases derived from thermostable Type A DNA 
polymerases that retain 5' nuclease activity but have reduced or absent synthetic activity. 
The abiUty to uncouple the synthetic activity of the enzyme from the 5' nuclease activity 
proves that the 5' nuclease activity does not require concurrent DNA synthesis as was 

25 previously reported (Gelfand, PGR Technology, supra). 

In addition to the 5 '-exonuclease domains of the DNA polymerase I proteins of 
Eubacteria, described above, 5' nucleases have been found associated with bacteriophage, 
eukaryotes and archaebacteria. Overall, all of the enzymes in this family display very 
similar substrate specificities, despite their limited level of sequence similarity. 

30 Consequently, enzymes suitable for use in the methods of the present invention may be 
isolated or derived from a wide array of sources. 
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A mammalian enzyme with fonctional similarity to the 5'-exonuclease domain of 
E. coli Pol I was isolated nearly 30 years ago (Lindahl, et al, Proc Natl Acad Sci U S A 
62(2): 597-603 [1969]). Later, additional members of this group of enzymes called flap 
endonucleases (FENl ) from Eukarya and Archaea were shown to possess a nearly 

5 identical structure specific activity (Harrington and Lieber. Embo J 13(5), 1235-46 
[1994]; Murante et al. , J Biol Chem 269(2), 1 191-6 [1994]; Robins, et al, J Biol Chem 
269(46), 28535-8 [1994]; Hosfield, etal, J Biol Chem 273(42), 27154-61 [1998]), 
despite limited sequence similarity. The substrate specificities of the FENl enzymes, and 
the eubacterial and related bacteriophage enzymes have been examined and found to be 

10 similar for all enzymes (Lyamichev, et al. Science 260(5 109), 778-83 [1993], Harrington 
and Lieber, supra, Murante, et al, supra, Hosfield, et al supra, Rao, et al, J Bacteriol 
180(20), 5406-12 [1998], Bhagwat, et al,. J. Biol Chem 272(45), 28523-30 [1997], 
Garforth and Sayers, Nucleic Acids Res 25(19), 3801-7 [1997]). 

Using preformed substrates, many of the studies cited above determined that these 

1 5 nucleases leave a gap upon cleavage, leading the authors to speculate that DNA 

polymerase must then act to fill in that gap to generate a ligatable nick. A number of 
other 5' nucleases have been shown to leave a gap or overiap after cleavage of the same 
or similar flap substrates. It has since been determined that that all the structure-specific 
5'-exonucleases leave a nick after cleavage if the substrate has an overiap between the 

20 upstream and downstream duplexes (Kaiser et al, J. Biol .Chem. 274(30):21387-21394 
[1999]). While duplexes having several bases of overiapping sequence can assume 
several different conformations through branch migration, it was determined that 
cleavage occurs in the conformation where the last nucleotide at the 3' end of the 
upstream strand is unpaired, with the cleavage rate being essentially the same whether the 

25 end of the upstream primer is A, C, G, or T. It was determined to be positional overlap 
between the 3' end of the upstiream primer and downstream duplex, rather then sequence 
overlap, that provides optknal cleavage. In addition to allowing these enzymes to leave a 
nick after cleavage, the single base of overiap causes the enzymes to cleave several 
orders of magnitude faster than when a substrate lacks overlap (Kaiser et al, supra). 

30 Any of the 5' nucleases described below may find application in one or more 

embodiments of the methods described herein. 5' nucleases of particular utility in the 
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methods of present invention include but are not limited to polymerases from a Thermus 
species including, but not limited to, Thermus aquaticus, Thermus flavus, Thermus 
thermophilus, Thermus filiformus, and Thermus scotoductus, and altered polymerases. 
Particularly preferred are altered polymerases exhibiting improved performance in 

5 detection assays based on the cleavage of a DNA member of an invasive cleavage 
structure that comprises an RNA target strand. 

Chimerical polymerases may find application in one or more embodiments of the 
present invention, including but not hmited to chimerical polymerases comprising one or 
more portions of one or more FEN nucleases including but are not limited to those of 

10 Methanococcus jannaschii, Methanobacterium thermoautotrophicum, Archaeoglobus 
veneficus, Sulfolobus solfataricus, Pyrobaculum aerophilum, Thermococcus litoralis, 
Archaeaglobus profundus, Acidianus brierlyi, Acidianus ambivalens, Desulfurococcus 
amylolyticus, Desulfurococcus mobilis, Pyrodictium brockii, Thermococcus gorgonarius, 
Thermococcus zilligH Methanopyrus kandleri Methanococcus igneus, Pyrococcus 

1 5 horikoshii, and Aeropyrum pernix; particularly preferred FENl enzymes are chimerical 
Archaeoglobus fulgidus and Pyrococcus furiosus. Particularly preferred are altered 
polymerases exhibiting improved performance in detection assays based on the cleavage 
of a DNA member of an invasive cleavage structure that comprises an RNA target strand. 

20 The detailed description of the invention is presented in the following sections: 

L Detection of Specific Nucleic Acid Sequences Using 5' Nucleases in an 

INVADER Directed Cleavage Assay; 
IL Signal Enhancement By Incorporating The Products Of An Invasive Cleavage 
25 Reaction Into A Subsequent Invasive Cleavage Reaction; 

III. Effect of ARRESTOR Oligonucleotides on Signal and Background in Sequential 
Invasive Cleavage Reactions. 

IV. Improved Enzymes For Use In INVADER Ohgonucleotide-Directed Cleavage 
Reactions Comprising RNA Targets; 

30 V. Reaction Design for INVADER Assay Detection of RNA Targets; 
VL Kits for performing the RNA INVADER Assay; and 
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VIL The INVADER Assay for Direct Detection and Measurement of Specific RNA 
Analytes. 

5 1. Detection of Specific Nucleic Acid Sequences Using 5' Nucleases in an 
INVADER Directed Cleavage Assay 

1 . INVADER Assay Reaction design 

The present invention provides means for forming a nucleic acid cleavage 
10 structure that is dependent upon the presence of a target nucleic acid and cleaving the 
nucleic acid cleavage structure so as to release distinctive cleavage products. 5' nuclease 
activity, for example, is used to cleave the target-dependent cleavage structure and the 
resulting cleavage products are indicative of the presence of specific target nucleic acid 
sequences in the sample. When two strands of nucleic acid, or oUgonucleotides, both 
1 5 hybridize to a target nucleic acid strand such that they form an overlapping invasive 
cleavage structure, as described below, invasive cleavage can occur. Through the 
interaction of a cleavage agent (e.g., a 5' nuclease) and the upstream ohgonucleotide, the 
cleavage agent can be made to cleave the downstream oligonucleotide at an internal site 
in such a way that a distinctive fragment is produced. Such embodiments have been 
20 termed the INVADER assay (Third Wave Technologies) and are described in U.S. Patent 
Appl. Nos. 5,846,717, 5,985,557, 5,994,069, 6,001,567, and 6,090,543 and PCT 
Pubhcations WO 97/27214 and WO 98/42873, herein incorporated by reference in their 
entireties. 

The present invention further provides assays in which the target nucleic acid is 
25 reused or recycled during multiple rounds of hybridization with oligonucleotide probes 
and cleavage of the probes without the need to use temperature cycling (i.e., for periodic 
denaturation of target nucleic acid strands) or nucleic acid synthesis (i.e., for the 
polymerization-based displacement of target or probe nucleic acid strands). When a 
cleavage reaction is run under conditions in which the probes are continuously replaced 
30 on the target strand (e.g. through probe-probe displacement or through an equilibrium 
between probe/target association and disassociation, or through a combination 
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comprising these mechanisms, [ Reynaldo et al., J. Mol. Biol. 97: 51 1 (2000)]) multiple 
probes can hybridize to the same target, allowing multiple cleavages, and the generation 
of multiple cleavage products. 

By the extent of its complementarity to a target nucleic acid strand, an 

5 oHgonucleotide may be said to define a specific region of the target. In an invasive 

cleavage structure, the two ohgonucleotides defme and hybridize to regions of the target 
that are adjacent to one another (i.e., regions without any additional region of the target 
between them). Either or both ohgonucleotides may comprise additional portions that are 
not complementary to the target strand, hi addition to hybridizing adjacently, in order to 

10 form an invasive cleavage structure, the 3' end of the upstream oHgonucleotide must 
comprise an additional moiety. When both ohgonucleotides are hybridized to a target 
strand to form a structure and such a 3' moiety is present on the upstream oligonucleotide 
within the structure, the ohgonucleotides may be said to overlap, and the structure may be 
described as an overlapping, or invasive cleavage structure. 

15 In one embodiment, the 3' moiety of the invasive cleavage structure is a single 

nucleotide. In this embodiment the 3' moiety may be any nucleotide {i.e., it may be, but 
it need not be complementary to the target strand). In a preferred embodiment the 3' 
moiety is a single nucleotide that is not complementary to the target strand. In another 
embodiment, the 3' moiety is a nucleotide-like compound (i.e., a moiety having chemical 

20 features similar to a nucleotide, such as a nucleotide analog or an organic ring compound; 
See e.g., U.S. Pat. No. 5,985,557). In yet another embodiment the 3' moiety is one or 
more nucleotides that duplicate in sequence one or more nucleotides present at the 5' end 
of the hybridized region of the downstream oligonucleotide. In a fiirther embodiment, 
the dupUcated sequence of nucleotides of the 3' moiety is followed by a single nucleotide 

25 that is not further dupUcative of the downstream oHgonucleotide sequence, and that may 
be any other nucleotide, hi yet another embodiment, the duplicated sequence of 
nucleotides of the 3' moiety is foUowed by a nucleotide-Hke compound, as described 
above. 

The downstream oligonucleotide may have, but need not have, additional moieties 
30 attached to either end of the region that hybridizes to the target nucleic acid strand. In a 
preferred embodiment, the downstream oligonucleotide comprises a moiety at its 5' end 
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{i.e., a 5' moiety). In a particularly preferred embodiment, said 5' moiety is a 5' flap or 
arm comprising a sequence of nucleotides that is not complementary to the target nucleic 
acid strand. 

When an overlapping cleavage structure is formed, it can be recognized and 

5 cleaved by a nuclease that is specific for this structure {i. e. , a nuclease that will cleave 
one or more of the nucleic acids in the overlapping structure based on recognition of this 
structure, rather than on recognition of a nucleotide sequence of any of the nucleic acids 
forming the structure). Such a nuclease may be termed a "structure-specific nuclease". 
In some embodiments, the structure-specific nuclease is a 5' nuclease. In a preferred 

10 embodiment, the structure-specific nuclease is the 5' nuclease of a DNA polymerase. In 
another preferred embodiment, the DNA polymerase having the 5' nuclease is synthesis- 
deficient. In another preferred embodiment, the 5' nuclease is a FEN-1 endonuclease. In 
a particularly preferred embodiment, the 5' nuclease is thermostable. 

In some embodiments, said structure-specific nuclease preferentially cleaves the 

15 downstream oligonucleotide. In a preferred embodiment, the downstream 

oligonucleotide is cleaved one nucleotide into the 5' end of the region that is hybridized 
to the target within the overlapping structure. Cleavage of the overlapping structure at 
any location by a structure-specific nuclease produces one or more released portions or 
fi-agments of nucleic acid, termed "cleavage products." 

20 In some embodiments, cleavage of an overlapping structure is performed under 

conditions wherein one or more of the nucleic acids in the structure can disassociate {i.e. 
un-hybridize, or melt) fi-om the structure. In one embodiment, full or partial 
disassociation of a first cleavage structure allows the target nucleic acid to participate in 
the formation of one or more additional overlapping cleavage structures. In a preferred 

25 embodiment, the furst cleavage structure is partially disassociated. In a particularly 

preferred embodiment only the oligonucleotide that is cleaved disassociates fi-om the first 
cleavage structure, such that it may be replaced by another copy of the same 
oligonucleotide. In some embodiments, said disassociation is induced by an increase in 
temperature, such that one or more ohgonucleotides can no longer hybridize to the target 

30 strand. In other embodiments, said disassociation occurs because cleavage of an 

oligonucleotide produces only cleavage products that cannot bind to the target strand 
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under the conditions of the reaction. In a preferred embodiment, conditions are selected 
wherein an oligonucleotide may associate with (i.e., hybridize to) and disassociate from a 
target strand regardless of cleavage, and wherein the ohgonucleotide may be cleaved 
when it is hybridized to the target as part of an overlapping cleavage structure. In a 
5 particularly preferred embodiment, conditions are selected such that the number of copies 
of the oligonucleotide that can be cleaved when part of an overlapping structure exceeds 
the number of copies of the target nucleic acid strand by a sufficient amount that when 
the first cleavage structure disassociates, the probabiHty that the target strand will 
associate with an intact copy of the oligonucleotide is greater than the probability that 
1 0 that it will associate with a cleaved copy of the oligonucleotide. 
fj In some embodiments, cleavage is performed by a structure-specific nuclease that 

* can recognize and cleave structures that do not have an overlap. In a preferred 

F embodiment, cleavage is performed by a structure-specific nuclease having a lower rate 

5; of cleavage of nucleic acid structures that do not comprise an overlap, compared to the 

^ 1 5 rate of cleavage of structures comprising an overlap. In a particularly preferred 
. embodiment, cleavage is performed by a structure-specific nuclease having less than 1% 

of the rate of cleavage of nucleic acid structures that do not comprise an overlap, 
compared to the rate of cleavage of structures comprising an overlap, 
p In some embodiments it is desirable to detect the cleavage of the overlapping 

^ 20 cleavage structure. Detection may be by analysis of cleavage products or by analysis of 
one or more of the remaining uncleaved nucleic acids. For convenience, the following 
discussion will refer to the analysis of cleavage products, but it will be appreciated by 
those skilled in the art that these methods may as easily be applied to analysis of the 
uncleaved nucleic acids in an invasive cleavage reaction. Any method known in the art 
25 for analysis of nucleic acids, nucleic acid fragments or oligonucleotides may be apphed 
to the detection of cleavage products. 

In one embodiment, the cleavage products may be identified by chemical content, 
e.g., the relative amounts of each atom, each particular type of reactive group or each 
nucleotide base (Chargaff et al, J. Biol Chem. Ill: 405 [1949]) they contain. In this 
30 way, a cleavage product may be distinguished from a longer nucleic acid from which it 
was released by cleavage, or from other nucleic acids. 
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In another embodiment, the cleavage products may be distinguished by a 
particular physical attribute, including but not limited to length, mass, charge, or charge- 
to-mass ratio. In yet another embodiment, the cleavage product may be distinguished by 
a behavior that is related to a physical attribute, including but not limited to rate of 
5 rotation in solution, rate of migration during electrophoresis, coefficient of sedimentation 
in centrifugation, time of flight in MALDI-TOF mass spectrometry, migration rate or 
other behavior in chromatography, mehing temperature from a complementary nucleic 
acid, or precipitabihty from solution. 

Detection of the cleavage products may be through release of a label. Such labels 

32 

1 0 may include, but are not limited to one or more of any of dyes, radiolabels such as P or 
^^S, binding moieties such as biotin, mass tags, such as metal ions or chemical groups, 
charge tags, such as polyamines or charged dyes, haptens such as digoxgenin, 
luminogenic, phosphorescent or fluorogenic moieties, and fluorescent dyes, either alone 
or in combination with moieties that can suppress or shift emission spectra, such as by 

1 5 fluorescence resonance energy transfer (FRET) or coUisional fluorescence energy 
transfer. 

In some embodiments, analysis of cleavage products may include physical 
resolution or separation, for example by electrophoresis, hybridization or by selective 
binding to a support, or by mass spectrometry methods such as MALDI-TOF. In other 

20 embodiments, the analysis may be performed without any physical resolution or 

separation, such as by detection of cleavage-induced changes in fluorescence as in FRET- 
based analysis, or by cleavage-induced changes in the rotation rate of a nucleic acid in 
solution as in fluorescence polarization analysis. 

Cleavage products can be used subsequently in any reaction or read-out method 

25 that can make use of oUgonucleotides. Such reactions include, but are not Umited to, 

modification reactions, such as ligation, taiUng with a template-independent nucleic acid 
polymerase and primer extension with a template-dependent nucleic acid polymerase. 
The modification of the cleavage products may be for purposes including, but not limited 
to, addition of one or more labels or binding moieties, alteration of mass, addition of 

30 specific sequences, or for any other purpose that would facilitate analysis of either the 
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cleavage products or analysis of any other by-product, result or consequence of the 
cleavage reaction. 

Analysis of the cleavage products may involve subsequent steps or reactions that 
do not modify the cleavage products themselves. For example, cleavage products may be 
5 used to complete a functional structure, such as a competent promoter for in vitro 

transcription or another protein binding site. Analysis may include the step of using the 
completed structure for or to perform its function. One or more cleavage products may 
also be used to complete an overlapping cleavage structure, thereby enabhng a 
subsequent cleavage reaction, the products of which may be detected or used by any of 

1 0 the methods described herein, including the participation in further cleavage reactions. 

Certain preferred embodiments of the invasive cleavage reactions are provided in 
the following descriptions. In some embodiments, the methods of the present invention 
employ at least a pair of oligonucleotides that interact with a target nucleic acid to form a 
cleavage structure for a structure-specific nuclease. In some embodiments, the cleavage 

1 5 structure comprises i) a target nucleic acid that may be either single-stranded or double- 
stranded (when a double-stranded target nucleic acid is employed, it may be rendered 
single stranded, e.g., by heating); ii) a first ohgonucleotide, termed the "probe," that 
defines a first region of the target nucleic acid sequence by being the complement of that 
region; iii) a second oligonucleotide, termed the "INVADER oHgonucleotide," the 5' part 

20 of which defines a second region of the same target nucleic acid sequence, adjacent to 
and downstream of the first target region, and the second part of which overlaps into the 
region defined by the first oligonucleotide. 

It can be considered that the binding of these oUgonucleotides in this embodiment 
divides the target nucleic acid into three distinct regions: one region that has 

25 complementarity to only the probe; one region that has complementarity only to the 
INVADER oligonucleotide; and one region that has complementarity to both 
oligonucleotides. As discussed above, in some preferred embodiments of the present 
invention, the overlap may comprise moieties other than overlapping complementary 
bases. Thus, in some embodiments, there is a physical, but not sequence, overlap 

30 between the INVADER and probe oligonucleotides, Le., in these latter embodiments, 
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there is not a region of the target nucleic acid that has complementarity to both 
ohgonucleotides. 

a) Oligonucleotide design 

Design of these ohgonucleotides (i.e., the INVADER oligonucleotide and the 
5 probe) is accomplished using practices that are standard in the art. For example, 

sequences that have self-complementarity, such that the resulting ohgonucleotides would 
either fold upon themselves, or hybridize to each other at the expense of binding to the 
target nucleic acid, are generally avoided. 

One consideration in choosing a length for these ohgonucleotides is the 

10 complexity of the sample containing the target nucleic acid. For example, the human 
genome is approximately 3x10^ basepairs in length. Any 10-nucleotide sequence will 
appear with a frequency of 1 :4^^, or 1 : 1,048,576 in a random string of nucleotides, which 
would be approximately 2,861 times in 3 billion basepairs. Clearly, an oligonucleotide of 
this length would have a poor chance of binding uniquely to a 10-nucleotide region 

15 within a target having a sequence the size of the human genome. If the target sequence 
were within a 3 kb plasmid, however, such an oligonucleotide might have a very 
reasonable chance of binding uniquely. By this same calculation it can be seen that an 
ohgonucleotide of 16 nucleotides (i.e., a 16-mer) is the minimum length of a sequence 
that is mathematically likely to appear once in 3 x 10^ basepairs. This level of specificity 

20 may also be provided by two or more shorter oligonucleotides if they are configured to 
bind in a cooperative fashion (i.e., such that they can produce the intended complex only 
if both or all are bound to their intended target sequences), wherein the combination of 
the short oligonucleotides provides the desired specificity. In one such embodiment, the 
cooperativity between the shorter oligonucleotides is by a coaxial stacking effect that can 

25 occur when the ohgonucleotides hybridize to adjacent sites on a target nucleic acid. In 
another embodiment, the shorter ohgonucleotides are connected to one another, either 
directly, or by one or more spacer regions. The short ohgonucleotides thus connected 
may bind to distal regions of the target and may be used to bridge across regions of 
secondary structure in a target. Examples of such bridging oligonucleotides are described 

30 in PCT Pubhcation WO 98/50403, herein incorporated by reference in its entirety. 
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A second consideration in choosing oligonucleotide length is the temperature 
range in which the oHgonucleotides will be expected to function. A 1 6-mer of average 
base content (50% G-C bases) will have a calculated of about 41^C, depending on, 
among other things, the concentration of the oligonucleotide and its target, the salt 
5 content of the reaction and the precise order of the nucleotides. As a practical matter, 
longer oligonucleotides are usually chosen to enhance the specificity of hybridization. 
Oligonucleotides 20 to 25 nucleotides in length are often used, as they are highly likely to 
be specific if used in reactions conducted at temperatures that are near their TmS (within 
about 5 ''C of the Tm). In addition, with calculated TmS in the range of 50 to VO'^C, such 

10 oHgonucleotides (i.e., 20 to 25-mers) are appropriately used in reactions catalyzed by 
thermostable enzymes, which often display optimal activity near this temperature range. 

The maximum length of the oligonucleotide chosen is also based on the desired 
specificity. One should avoid choosing sequences that are so long that they are either at a 
high risk of binding stably to partial complements, or that they cannot easily be dislodged 

15 when desired (e.g., failure to disassociate from the target once cleavage has occurred or 
failure to disassociate at a reaction temperature suitable for the enzymes and other 
materials in the reaction). 

The first step of design and selection of the oHgonucleotides for the INVADER 
oHgonucleotide-directed cleavage is in accordance with these sample general principles. 

20 Considered as sequence-specific probes individually, each oligonucleotide may be 
selected according to the guidelines listed above. That is to say, each oligonucleotide 
will generally be long enough to be reasonably expected to hybridize only to the intended 
target sequence within a complex sample, usually in the 20 to 40 nucleotide range. 
Ahematively, because the INVADER ohgonucleotide-directed cleavage assay depends 

25 upon the concerted action of these oUgonucleotides, the composite length of the 2 

oligonucleotides which span/bind to the target may be selected to fall within this range, 
with each of the individual oligonucleotides being in approximately the 13 to 17 
nucleotide range. Such a design might be employed if a non-thermostable cleavage 
means were employed in the reaction, requiring the reactions to be conducted at a lower 

30 temperature than that used when thermostable cleavage means are employed. In some 
embodiments, it may be desirable to have these oligonucleotides bind multiple times 
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within a single target nucleic acid {e.g., to bind to multiple variants or multiple similar 
sequences within a target). It is not intended that the method of the present invention be 
limited to any particular size of the probe or INVADER oligonucleotide. 

The second step of designing an oligonucleotide pair for this assay is to choose 

5 the degree to which the upstream "INVADER" oUgonucleotide sequence will overiap 
into the downstream "probe" ohgonucleotide sequence, and consequently, the sizes into 
which the probe will be cleaved. In some preferred embodiments, the probe 
oligonucleotide can be made to "turn over," that is to say probe can be made to depart to 
allow the binding and cleavage of other copies of the probe molecule, without the 

1 0 requirements of thermal denaturation or displacement by polymerization. While in one 
embodiment of this assay probe turnover may be facilitated by an exonucleolytic 
digestion by the cleavage agent, in some preferred embodiments of the present invention 
turnover does not require this exonucleolytic activity. For example, m some 
embodiments, a reaction temperature and reaction conditions are selected so as to create 

15 an equilibrium wherein the probe hybridizes and disassociates from the target. In other 
embodiments, temperature and reaction conditions are selected so that unbound probe can 
initiate binding to the target strand and physically displace bound probe. In still other 
embodiments, temperature and reaction conditions are selected such that either or both 
mechanisms of probe replacement may occur in any proportion. The method of the 

20 present invention is not limited to any particular mechanism of probe replacement. By 
any mechanism, when the probe is bound to the t^get to form a cleavage structure, 
cleavage can occur. The continuous cycling of the probe on and off of the target allows 
multiple probes to bind and be cleaved for each copy of a target nucleic acid. 

i) Non-sequence Overlaps 

25 It has been determined that the relationship between the 3' end of the upstream 

oligonucleotide and the desired site of cleavage on the probe should be carefully 
designed. It is known that the preferred site of cleavage for the types of structure-specific 
endonucleases employed herein is one basepair into a duplex (Lyamichev et al, supra). 
It was previously believed that the presence of an upstream oligonucleotide or primer 

30 allowed the cleavage site to be shifted away from this preferred site, into the single 
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stranded region of the 5' arm (Lyamichev et al, supra and U.S. Patent No. 5,422,253). In 
contrast to this previously proposed mechanism, and while not limiting the present 
invention to any particular mechanism, it is believed that the nucleotide immediately 5', 
or upstream of the cleavage site on the probe (including miniprobe and mid-range probes) 
5 should be able to basepair with the target for efficient cleavage to occur. In the case of 
the present invention, this would be the nucleotide in the probe sequence immediately 
upstream of the mtended cleavage site. In addition, as described herein, it has been 
observed that in order to direct cleavage to that same site in the probe, the upstream 
oUgonucleotide should have its 3' base (i.e., nt) immediately upstream of the intended 

1 0 cleavage site of the probe. In embodiments where the INVADER and probe 

oligonucleotides share a sequence overlap, this places the 3' terminal nucleotide of the 
upstream ohgonucleotide and the base of the probe ohgonucleotide 5' of the cleavage site 
in competition for pairing with the corresponding nucleotide of the target strand. 

To examine the outcome of this competition (i.e. which base is paired during a 

1 5 successful cleavage event), substitutions were made in the probe and INVADER 
oligonucleotides such that either the probe or the INVADER oligonucleotide were 
mismatched with the target sequence at this position. The effects of both arrangements 
on the rates of cleavage were examined. When the INVADER oligonucleotide is 
unpaired at the 3' end, the rate of cleavage was not reduced. If this base was removed, 

20 however, the cleavage site was shifted upstream of the intended site. In contrast, if the 
probe oligonucleotide was not base-paired to the target just upstream of the site to which 
the INVADER oligonucleotide was directing cleavage, the rate of cleavage was 
dramatically reduced, suggesting that when a competition exists, the probe 
oligonucleotide was the molecule to be base-paired in this position. 

25 It appears that the 3' end of the upstream INVADER ohgonucleotide is unpaired 

during cleavage, and yet is important for accurate positioning of the cleavage. To 
examine which part(s) of the 3' terminal nucleotide are required for the positioning of 
cleavage, INVADER oligonucleotides were designed that terminated on this end with 
nucleotides that were aUered in a variety of ways. Sugars examined included 2' 

30 deoxyribose with a 3' phosphate group, a dideoxyribose, 3' deoxyribose, T 0-methyl 
ribose, arabinose and arabinose with a 3' phosphate. Abasic ribose, with and without 3' 
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phosphate were tested. Synthetic "universal" bases such at 3-nitropyrrole and 5-3 
nitroindole on ribose sugars were tested. Finally, a base-like aromatic ring structure, 
acridine, hnked to the 3' end the previous nucleotide without a sugar group was tested. 
The resuhs obtained support the conclusion that the aromatic ring of the base (at the 3' 
5 end of the INVADER oligonuceotide) is an important moiety for accomplishing the 
direction of cleavage to the desired site within the downstream probe. The 3' terminal 
moiety of the INVADER ohgonucleotide need not be a base that is complementary to the 
target nucleic acid. 

ii) Miniprobes And Mid-Range Probes; 

10 As discussed above, the INVADER oligonucleotide-directed cleavage assay may 

be performed using INVADER and probe oUgonucleotides that have a length of about 
13-25 nucleotides (typically 20-25 nucleotides). It is also contemplated that the 
oligonucleotides may themselves be composed of shorter oligonucleotide sequences that 
align along a target strand but that are not covalently linked. This is to say that there is a 

15 nick in the sugar-phosphate backbone of the composite oligonucleotide, but that there is 
no disruption in the progression of base-paired nucleotides in the resulting duplex. When 
short strands of nucleic acid aUgn contiguously along a longer strand the hybridization of 
each is stabihzed by the hybridization of the neighboring fragments because the basepairs 
can stack along the helix as though the backbone was in fact uninterrupted. This 

20 cooperativity of binding can give each segment a stabiHty of interaction in excess of what 
would be expected for the segment hybridizing to the longer nucleic acid alone. One 
appUcation of this observation has been to assemble primers for DNA sequencing, 
typically about 18 nucleotides long, from sets of three hexamer oUgonucleotides that are 
designed to hybridize in this way (Kotler et al Proc. Natl. Acad. Sci. USA 90:4241 

25 [1993]). The resulting doubly-nicked primer can be extended enzymatically in reactions 
performed at temperatures that might be expected to disrupt the hybridization of 
hexamers, but not of 18-mers. 

The use of composite or spUt oligonucleotides is applied with success in the 
INVADER-directed cleavage assay. For example, the probe oligonucleotide may be split 

30 into two oligonucleotides that anneal in a contiguous and adjacent manner along a target 
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oligonucleotide such that the downstream ohgonucleotide (analogous to the probe) is 
assembled from two smaller pieces: a short segment of 6-10 nts (termed the 
"miniprobe"), that is to be cleaved in the course of the detection reaction, and an 
ohgonucleotide that hybridizes immediately downstream of the miniprobe (termed the 

5 "stacker"), that serves to stabilize the hybridization of the probe. To form the cleavage 
structure, an upstream oligonucleotide (the INVADER oligonucleotide) is provided to 
direct the cleavage activity to the desired region of the miniprobe. Assembly of the probe 
from non-linked pieces of nucleic acid (/>., the miniprobe and the stacker) allows regions 
of sequences to be changed without requiring the re-synthesis of the entire proven 

10 sequence, thus improving the cost and flexibiUty of the detection system. In addition, the 
use of unlinked composite oligonucleotides makes the system more stringent in its 
requirement of perfectly matched hybridization to achieve signal generation, allowing 
this to be used as a sensitive means of detecting mutations or changes in the target nucleic 
acid sequences. 

15 In one embodiment, the methods of the present invention employ at least three 

ohgonucleotides that interact with a target nucleic acid to form a cleavage structure for a 
structure-specific nuclease. More specifically, the cleavage structure comprises i) a 
target nucleic acid that may be either single-stranded or double-stranded (when a double- 
stranded target nucleic acid is employed, it may be rendered single-stranded, e.g., by 

20 heating); ii) a first ohgonucleotide, termed the "stacker," that defines a first region of the 
target nucleic acid sequence by being the complement of that region.; iii) a second 
oligonucleotide, termed the "miniprobe," that defines a second region of the target 
nucleic acid sequence by being the complement of that region ; iv) a third 
ohgonucleotide, termed the "INVADER ohgonucleotide," the 5' part of which defines a 

25 third region of the same target nucleic acid sequence, adjacent to and downstream of the 
second target region, and the second or 3' part of which overlaps into the region defined 
by the second ohgonucleotide As described above for embodiments that do not employ a 
stacker, the overlap region can represent a region where there is a physical, but not 
sequence, overlap between the INVADER and probe oligonucleotides. 

30 
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In addition to the benefits cited above, the use of a composite design for the 
oligonucleotides that form the cleavage structure allows more latitude in the design of the 
reaction conditions for performing the INVADER-directed cleavage assay. When a 
longer probe (e.g., 16-25 nt), as described above, is used for detection in reactions that 
5 are performed at temperatures below the Tm of that probe, the cleavage of the probe may 
play a significant role in destabihzing the duplex of which it is a part, thus allowing 
turnover and reuse of the recognition site on the target nucleic acid. In contrast, reaction 
temperatures that are at or above the Tm of the probe mean that the probe molecules are 
hybridizing and releasing from the target quite rapidly even without cleavage of the 
10 probe. When an upstream INVADER oligonucleotide and a cleavage agent are provided 
the probe will be specifically cleaved, but the cleavage will not be necessary to the 
3 turnover of the probe. When a long probe 1 6-25 nt) is used in this way the 

temperatures required to achieve this state is high, around 65 to 70°C for a 25-mer of 
* average base composition. Requiring the use of such elevated temperatures limits the 

h! 1 5 choice of cleavage agents to those that are very thermostable, and may contribute to 
backgroxmd in the reactions, depending of the means of detection, through thermal 
El degradation of the probe ohgonucleotides. With miniprobes, this latter mechanism of 

% I probe replacement may be accomplished at a lower temperature. Thus, shorter probes are 

f J preferred for embodiments using lower reaction temperatures. 

|i 20 The miniprobe of the present invention may vary in size depending on the desired 

application. In one embodiment, the probe may be relatively short compared to a 
standard probe (e.g., 16-25 nt), in the range of 6 to 10 nucleotides. When such a short 
probe is used, reaction conditions can be chosen that prevent hybridization of the 
miniprobe in the absence of the stacker oUgonucleotide. In this way a short probe can be 
25 made to assume the statistical specificity and selectivity of a longer sequence. In the 
event of a perturbation in the cooperative binding of the miniprobe and stacker nucleic 
acids, as might be caused by a mismatch within the short sequence that is otherwise 
complementary to the target nucleic acid or at the junction between the contiguous 
duplexes, this cooperativity can be lost, dramatically reducing the stability of the shorter 
30 duplex (i.e,, that of the miniprobe), and thus reducing the level of cleaved product in the 
assay of the present invention. 
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It is also contemplated that probes of intermediate size may be used. Such probes, 
in the 11 to 15 nucleotide range, may blend some of the features associated with the 
longer probes as originally described, these features including the abiUty to hybridize and 
be cleaved absent the help of a stacker oligonucleotide. At temperatures below the 

5 expected Tm of such probes, the mechanisms of turnover may be as discussed above for 
probes in the 20 nt range, and be dependent on the removal of the sequence in the overiap 
region for destabiUzation and cycling. 

The mid-range probes may also be used at elevated temperatures, at or above their 
expected Tm, to allow melting rather than cleavage to promote probe turnover. In 

10 contrast to the longer probes described above, however, the temperatures required to 
allow the use of such a thermally driven turnover are much lower (about 40 to 60°C), 
thus preserving both the cleavage means and the nucleic acids in the reaction from 
thermal degradation. la this way, the mid-range probes may perform in some instances 
like the miniprobes described above. In a further similarity to the miniprobes, the 

1 5 accumulation of cleavage signal from a mid-range probe may be helped under some 
reaction conditions by the presence of a stacker. 

To summarize, a standard long probe usually does not benefit from the presence 
of a stacker oligonucleotide downstream (the exception being cases where such an 
ohgonucleotide may also disrupt structures in the target nucleic acid that interfere with 

20 the probe binding), and it may be used in conditions requiring several nucleotides to be 
removed to allow the oligonucleotide to release from the target efficiently. If temperature 
of the reaction is used to drive exchange of the probes, standard probes may require use 
of a temperature at which nucleic acids and enzymes are at higher risk of thermal 
degradation. 

25 The miniprobe is very short and performs optimally in the presence of a 

downstream stacker oligonucleotide. The miniprobes are well suited to reactions 
conditions that use the temperature of the reaction to drive rapid exchange of the probes 
on the target regardless of whether any bases have been cleaved. In reactions with 
sufficient amount of the cleavage means, the probes that do bind will be rapidly cleaved 

30 before they melt off. 
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The mid-range or midiprobe combines features of these probes and can be used in 
reactions like those favored by long probes, with longer regions of overlapto drive probe 
turnover at lower temperature. In a preferred embodiment, the midrange probes are used 
at temperatures sufficiently high that the probes are hybridizing to the target and 

5 releasing rapidly regardless of cleavage. The mid-range probe may have enhanced 
performance in the presence of a stacker imder some circumstances. 

The distinctions between the mini-, midi- (i.e., mid-range) and long probes are not 
contemplated to be inflexible and based only on length. The performance of any given 
probe may vary with its specific sequence, the choice of solution conditions, the choice of 

10 temperature and the selected cleavage means. 

The assemblage of oligonucleotides that comprises the cleavage structure of the 
present invention is sensitive to mismatches between the probe and the target. It is also 
contemplated that a mismatch between the INVADER oUgonucleotide and the target may 
be used to distinguish related target sequences. In the 3 -oligonucleotide system, 

1 5 comprising an INVADER, a probe and a stacker oUgonucleotide, it is contemplated that 
mismatches may be located within any of the regions of duplex formed between these 
oUgonucleotides and the target sequence. In a preferred embodiment, a mismatch to be 
detected is located in the probe. In a particularly preferred embodiment, the mismatch is 
in the probe, at the basepair immediately upstream (i.e., 5') of the site that is cleaved 

20 when the probe is not mismatched to the target. 

In another preferred embodiment, a mismatch to be detected is located within the 
region defined by the hybridization of a miniprobe. hi a particularly preferred 
embodiment, the mismatch is in the miniprobe, at the basepair immediately upstream 
(i.e., 5') of the site that is cleaved when the miniprobe is not mismatched to the target. 

25 iii) Software for Oligonucleotide Design for the INVADER assay 

The present invention provides systems and methods for the design of 
oligonucleotides for use in detection assays. In particular, the present invention provides 
systems and methods for the design of oUgonucleotides that successfiiUy hybridize to 
appropriate regions of target nucleic acids (e.g., regions of target nucleic acids that do not 

30 contain secondary structure) under the desired reaction conditions (e.g., temperature, 
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buffer conditions, etc.) for the detection assay. The systems and methods also allow for 
the design of multiple different oligonucleotides (e.g., oligonucleotides that hybridize to 
different portions of a target nucleic acid or that hybridize to two or more different target 
nucleic acids) that all ftmction in the detection assay under the same or substantially the 
same reaction conditions. These systems and methods may also be used to design control 
samples that work under the experimental reaction conditions. 

While the systems and methods of the present invention are not limited to any 
particular detection assay, the following description illustrates the invention when used in 
conjunction with the INVADER assay (Third Wave Technologies, Madison WI; See e.g., 
U.S. Pat. Nos. 5,846,717, 5,985,557, 5,994,069, and 6,001,567 and PCT Publications WO 
97/27214 and WO 98/42873, incorporated herein by reference in their entireties) to detect 
a SNP. One skilled in the art will appreciate that specific and general features of this 
illustrative example are generally applicable to other detection assays, and for use in 
designing INVADER assays for purposes other than SNP detection (e.g., for DNA or 
RNA quantitation, for RNA splice junction detection, etc.). Further, it will be 
appreciated that all algorithms described herein can be appUed as separate software 
elements, or calculations may be performed manually, for the design of any INVADER 
assay probe set without use of the INVADERCREATOR design system described below. 

Oligonucleotide Design for the INVADER assay using the 
INVADERCREATOR program 

In some embodiments where an ohgonucleotide is designed for use in the 
INVADER assay to detect a SNP, the sequence(s) of interest are entered into the 
INVADERCREATOR program (Third Wave Technologies, Madison, WI). As described 
above, sequences may be input for analysis from any number of sources, either directly 
into the computer hosting the INVADERCREATOR program, or via a remote computer 
linked through a communication network (e.g., a LAN, Intranet or Internet network). 
The program designs probes for both the sense and antisense strand. Strand selection is 
generally based upon the ease of synthesis, minimization of secondary structure 
formation, and manufacturability. In some embodiments, the user chooses the strand for 
sequences to be designed for. In other embodiments, the software automatically selects 
the strand. By incorporating thermodynamic parameters for optimum probe cycling and 
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signal generation (Allawi and SantaLucia, Biochemistry, 36:10581 [1997]), 
oligonucleotide probes may be designed to operate at a pre-selected assay temperature 
{e.g., 63°C). Based on these criteria, a final probe set {e.g., primary probes for 2 alleles 
and an INVADER oligonucleotide) is selected. 

5 In some embodiments, the INVADERCREATOR system is a web-based program 

with secure site access that contains a link to BLAST (available at the National Center for 
Biotechnology Information, National Library of Medicine, National Institutes of Health 
web site) and that can be linked to RNAstructure (Mathews et al, RNA 5:1458 [1999]), a 
software program that incorporates mfold (Zuker, Science, 244:48 [1989]). 

1 0 RNAstructure tests the proposed ohgonucleotide designs generated by 

INVADERCREATOR for potential uni- and bimolecular complex formation. 
INVADERCREATOR is open database connectivity (ODBC)-compliant and uses the 
Oracle database for export/integration. The INVADERCREATOR system was 
configured with Oracle to work well with UNIX systems, as most genome centers are 

15 UNIX-based. 

In some embodiments, the INVADERCREATOR analysis is provided on a 
separate server {e.g., a Sun server) so it can handle analysis of large batch jobs. For 
example, a customer can submit up to 2,000 SNP sequences in one email. The server 
passes the batch of sequences on to the INVADERCREATOR software, and, when 

20 initiated, the program designs SNP sets. Li some embodiments, probe set designs are 
returned to the user within 24 hours of receipt of the sequences. 

In some preferred embodiments, each INVADER assay reaction includes at least 
two target sequence-specific, unlabeled oUgonucleotides for the primary reaction: an 
upstream INVADER oligonucleotide and a downstream Probe oligonucleotide. The 

25 INVADER oligonucleotide is generally designed to bind stably at the reaction 

temperature, while the probe is designed to freely associate and disassociate with the 
target strand, with cleavage occurring only when an uncut probe hybridizes adjacent to an 
overlapping INVADER oligonucleotide. In some embodiments, the probe includes a 5' 
flap or "arm" that is not complementary to the target, and this flap is released from the 

30 probe when cleavage occurs. In some embodiments, the released flap participates as an 
INVADER oligonucleotide in a secondary reaction. 
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The following discussion provides one example of how a user interface for an 
INVADERCREATOR program may be configured. 

The user opens a work screen (Figure 42), e.g., by clicking on an icon on a 
desktop display of a computer (e.g., a Windows desktop). The user enters information 
5 related to the target sequence for which an assay is to be designed. In some 

embodiments, the user enters a target sequence. In other embodiments, the user enters a 
code or number that causes retrieval of a sequence jfrom a database. In still other 
embodiments, additional information may be provided, such as the user's name, an 
identifying number associated with a target sequence, and/or an order number. In 
10 preferred embodiments, the user indicates (e.g. via a check box or drop down menu) that 
the target nucleic acid is DNA or RNA. In other preferred embodiments, the user 
indicates the species from which the nucleic acid is derived. In particularly preferred 
m embodiments, the user indicates whether the design is for monoplex (i.e., one target 

i; sequence or allele per reaction) or multiplex (i.e., multiple target sequences or alleles per 

f^l 15 reaction) detection. When the requisite choices and entries are complete, the user starts 
01 the analysis process. In one embodiment, the user clicks a "Go Design It" button to 

^ continue. 

y ^ In some embodiments, the software vahdates the field entries before proceeding. 

J: In some embodiments, the software verifies that any required fields are completed with 

^ 20 the appropriate type of information. In other embodiments, the software verifies that the 
input sequence meets selected requirements (e.g., minimum or maximum length, DNA or 
RNA content). If entries in any field are not found to be vaUd, an error message or dialog 
box may appear. In preferred embodiments, the error message indicates which field is 
incomplete and/or incorrect. Once a sequence entry is verified, the software proceeds 
25 with the assay design. 

In some embodiments, the information supplied in the order entry fields specifies 
what type of design will be created. In preferred embodiments, the target sequence and 
multiplex check box specify which type of design to create. Design options include but 
are not limited to SNP assay, Multiplexed SNP assay (e.g., wherein probe sets for 
30 different alleles are to be combined in a single reaction), Multiple SNP assay (e.g., 
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wherein an input sequence has multiple sites of variation for which probe sets are to be 
designed), and Multiple Probe Arm assays. 

In some embodiments, the INVADERCREATOR software is started via a Web 
Order Entry (WebOE) process (ie., through an Intra/Internet browser interface) and these 
5 parameters are transferred from the WebOE via applet <param> tags, rather than entered 
through menus or check boxes. 

In the case of Multiple SNP Designs, the user chooses two or more designs to 
work with. In some embodiments, this selection opens a new screen view (e.g., a 
Multiple SNP Design Selection view Figure 43). In some embodiments, the software 

10 creates designs for each locus in the target sequence, scoring each, and presents them to 
the user in this screen view. The user can then choose any two designs to work with. In 
some embodiments, the user chooses a first and second design (e.g., via a menu or 
buttons) and clicks a "Go Design It" button to continue. 

To select a probe sequence that will perform optimally at a pre-selected reaction 

15 temperature, the melting temperature (Tm) of the SNP to be detected is calculated using 
the nearest-neighbor model and published parameters for DNA duplex formation (Allawi 
and SantaLucia, Biochemistry, 36:10581 [1997]). In embodiments wherein the target 
strand is RNA, parameters appropriate for RNA/DNA heteroduplex formation may be 
used. Because the assay's salt concentrations are often different than the solution 

20 conditions in which the nearest-neighbor parmieters were obtained (IM NaCl and no 
divalent metals), and because the presence and concentration of the enzyme influence 
optimal reaction temperature, an adjustment should be made to the calculated T^ to 
determine the optimal temperature at which to perform a reaction. One way of 
compensating for these factors is to vary the value provided for the salt concentration 

25 within the melting temperature calculations. This adjustment is termed a 'salt correction'. 
As used herein, the term "salt correction" refers to a variation made in the value provided 
for a salt concentration for the purpose of reflecting the effect on a T^ calculation for a 
nucleic acid duplex of a non-salt parameter or condition affecting said duplex. Variation 
of the values provided for the strand concentrations will also affect the outcome of these 

30 calculations. By using a value of 0.5 M NaCI (SantaLucia, Proc Natl Acad Sci USA, 
95:1460 [1998]) and strand concentrations of about 1 mM of the probe and 1 flVI target. 
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the algorithm used for calculating probe-target melting temperature has been adapted for 
use in predicting optimal INVADER assay reaction temperature. For a set of 30 probes, 
the average deviation between optimal assay temperatures calculated by this method and 
those experimentally determined is about 1,5 °C, 
5 The length of the downstream probe analyte-specific region (ASR) is defined by 

the temperature selected for running the reaction (e.g. , 63°C). Starting from the position 
of the variant nucleotide on the target DNA (the target base that is paired to the probe 
nucleotide 5' of the intended cleavage site), and adding on the 3' end, an iterative 
procedure is used by which the length of the target-binding region of the probe is 

10 increased by one base pair at a time until a calculated optimal reaction temperature (Tm 
plus salt correction to compensate for enzyme effect) matching the desired reaction 
temperature is reached. The non-complementary arm of the probe is preferably selected 
to allow the secondary reaction to cycle at the same reaction temperature. The entire 
probe oligonucleotide is screened using programs such as mfold (Zuker, Science, 244: 48 

15 [1989]) or Oligo 5.0 (Rychlik and Rhoads, Nucleic Acids Res, 17: 8543 [1989]) for the 
possible formation of dimer complexes or secondary structures that could interfere with 
the reaction. The same principles are also followed for INVADER oligonucleotide 
design. Briefly, starting from the position N on the target DNA, the 3* end of the 
INVADER oligonucleotide is designed to have a nucleotide not complementary to either 

20 allele suspected of being contained in the sample to be tested. The mismatch does not 
adversely affect cleavage (Lyamichev et aL, Nature Biotechnology, 17: 292 [1999]), and 
it can enhance probe cycling, presumably by minimizing coaxial stabilization effects 
between the two probes. Additional residues complementary to the target DNA starting 
from residue N-1 are then added in the 5' direction until the stability of the 

25 INVADERoligonucleotide-target hybrid exceeds that of the probe (and therefore the 
planned assay reaction temperature), generally by 15-20 °C. 

In some embodiments, the released cleavage fragment from a primary reaction is 
to be used in a secondary reaction. It is one aspect of the assay design that the all of the 
probe sequences may be selected to allow the primary and secondary reactions to occur at 

30 the same optimal temperature, so that the reaction steps can run simultaneously. In an 
alternative embodiment, the probes may be designed to operate at different optimal 
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temperatures, so that the reaction steps are not simultaneously at their temperature 
optima. 

In some embodiments, the software provides the user an opportunity to change 
various aspects of the design including but not limited to: probe, target and INVADER 
oligonucleotide temperature optima and concentrations; blocking groups; probe arms; 
dyes, capping groups and other adducts; individual bases of the probes and targets (e.g., 
adding or deleting bases from the end of targets and/or probes, or changing internal bases 
in the INVADER and/or probe and/or target oligonucleotides). In some embodiments, 
changes are made by selection from a menu. In other embodiments, changes are entered 
into text or dialog boxes. In preferred embodiments, this option opens a new screen (e.g., 
a Designer Worksheet view, Figure 44). 

In some embodiments, the software provides a scoring system to indicate the 
quality (e.g., the Hkelihood of performance) of the assay designs. In one embodiment, 
the scoring system includes a starting score of points (e.g., 100 points) wherein the 
starting score is indicative of an ideal design, and wherein design features known or 
suspected to have an adverse affect on assay performance are assigned penalty values. 
Penalty values may vary depending on assay parameters other than the sequences, 
including but not Hmited to the type of assay for which the design is intended (e.g., 
monoplex, multiplex) and the temperature at which the assay reaction will be performed. 
The following example provides illustrative scoring criteria for use with some 
embodiments of the INVADER assay based on an intelligence defined by 
experimentation. Examples of design features that may incur score penalties include but 
are not limited to the following [penalty values are indicated in brackets, first number is 
for lower temperature assays (e.g., 62-64 ^C), second is for higher temperature assays 
65-66 X)]: 

1. [100:100] 3' end of INVADER ohgonucleotide resembles the probe arm: 

ARM SEQUENCE: PENALTY AWARDED IF INVADER 

OLIGONUCLEOTIDE ENDS IN: 

Arml: CGCGCCGAGG 5'.,.GAGGX or 5'...GAGGXX 

Arm2: ATGACGTGGCAGAC 5'...CAGACX or 5',..CAGACXX 
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Arm 3: ACGGACGCGGAG 5'...GGAGX or 5'...GGAGXX 

Arm 4: TCCGCGCGTCC 5'...GTCCX or 5'...GTCCXX 

2. [70:70] a probe has 5-base stretch (i.e., 5 of the same base in a row) containing 

the polymorphism; 

5 3. [60:60] a probe has 5-base stretch adjacent to the polymorphism; 

4. [50:50] a probe has 5-base stretch one base from the polymorphism; 

5. [40:40] a probe has 5-base stretch two bases from the polymorphism; 

6. [50:50] probe 5-base stretch is of Gs - additional penalty; 

7. [ 1 00: 1 00] a probe has 6-base stretch anywhere; 

10 8. [90:90] a two or three base sequence repeats at least four times; 

9. [ 1 00: 1 00] a degenerate base occurs in a probe; 

1 0. [60:90] probe hybridizing region is short (13 bases or less for designs 65-67°C; 12 
bases or less for designs 62-64°C) 

1 1 . [40:90] probe hybridizing region is long (29 bases or more for designs 65-67°C, 28 
15 bases or more for designs 62-64°C) 

12. [5:5] probe hybridizing region length - per base additional penalty 

13. [80:80] Ins/Del design with poor discrimination in first 3 bases after probe arm 

14. [100:100] calculated INVADER oligonucleotide Tm within 7.5°C of probe target 
Tm (designs 65-67°C with INVADER oUgonucleotide less than < 70.5°C, designs 62- 

20 64°C with INVADER oligonucleotide < 69.5°C 

1 5 . [20 :20] calculated probes Tms differ by more than 2. 0°C 

16. [100:100] a probe has calculated Tm2°C less than its target Tm 

17. [10:10] target of one strand 8 bases longer than that of other sfrand 

18. [30:30] INVADER oligonucleotide has 6-base stretch anywhere - initial penalty 
25 19. [70:70] INVADER oligonucleotide 6-base stretch is of Gs - additional penalty 

20. [15:15] probe hybridizing region is 14, 15 or 24-28 bases long (65-67°C) or 13,14 
or 26,27 bases long (62-64°C) 

21. [ 1 5 : 1 5] a probe has a 4-base stretch of Gs containing the polymorphism 

30 In particularly preferred embodiments, temperatures for each of the 

oligonucleotides in the designs are recomputed and scores are recomputed as changes are 
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made. In some embodiments, score descriptions can be seen by clicking a "descriptions" 
button. In some embodiments, a BLAST search option is provided. In preferred 
embodiments, a BLAST search is done by cUcking a "BLAST Design" button. In some 
embodiments, this action brings up a dialog box describing the BLAST process. In 

5 preferred embodiments, the BLAST search results are displayed as a highUghted design 
on a Designer Worksheet. 

In some embodiments, a user accepts a design by clicking an "Accept" button. In 
other embodiments, the program approves a design without user intervention. In 
preferred embodiments, the program sends the approved design to a next process step 

1 0 (e.g. , into production; into a file or database). In some embodiments, the program 
provides a screen view (e.g., an Output Page, Figure 45), allowing review of the fmal 
designs created and allowing notes to be attached to the design. In preferred 
embodiments, the user can return to the Designer Worksheet (e.g., by clicking a "Go 
Back" button) or can save the design (e.g., by clicking a "Save It" button) and continue 

1 5 (e.g. , to submit the designed oHgonucleotides for production). 

In some embodiments, the program provides an option to create a screen view of 
a design optimized for printing (e.g , a text-only view) or other export (e.g , an Output 
view, Figure 46). In preferred embodiments, the Output view provides a description of 
the design particularly suitable for printing, or for exporting into another apphcation (e.g, 

20 by copying and pasting into another apphcation). In particularly preferred embodiments, 
the Output view opens in a separate window. 

The present invention is not limited to the use of the INVADERCREATOR 
software. Indeed, a variety of software programs are contemplated and are commercially 
available, including, but not hmited to GCG Wisconsin Package (Genetics computer 

25 Group, Madison, WI) and Vector NTI (Informax, Rockville, Maryland). 

b) Design Of The Reaction Conditions 

Target nucleic acids (e.g., RNA and DNA) that may be analyzed using the 
methods of the present invention that employ a 5' nuclease or other appropriate cleavage 
agents. Such nucleic acids may be obtained using standard molecular biological 
30 techniques. For example, nucleic acids (RNA or DNA) may be isolated from a tissue 
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sample (e.g., a biopsy specimen), tissue culture cells, samples containing bacteria and/or 
virases (including cultures of bacteria and/or viruses), etc. The target nucleic acid may 
also be transcribed in vitro from a DNA template or may be chemically synthesized or 
amplified in by poljnnerase chain reaction. Furthermore, nucleic acids may be isolated 
5 from an organism, either as genomic material or as a plasmid or similar 

extrachromosomal DNA, or they may be a fragment of such material generated by 
treatment with a restriction endonuclease or other cleavage agent, or a shearing force, or 
it may be synthetic. 

Assembly of the target, probe, and ESfVADER oUgonucleotide nucleic acids into 
10 the cleavage reaction of the present invention uses principles commonly used in the 

design of oligonucleotide-based enzymatic assays, such as dideoxynucleotide sequencing 
O and polymerase chain reaction (PCR). As is done in these assays, the oligonucleotides 

are provided in sufficient excess that the rate of hybridization to the target nucleic acid is 
very rapid. These assays are conmionly performed with 50 finoles to 2 pmoles of each 
i1 15 oligonucleotide per microliter of reaction mixture, although they are not necessarily 
S limited to this range. In the Examples described herein, amounts of oligonucleotides 

ranging from 250 frnoles to 5 pmoles per microliter of reaction volume were used. These 
m values were chosen for the purpose of ease in demonstration and are not intended to limit 

the performance of the present invention to these concentrations. Other (e.g., lower) 
0 20 oligonucleotide concentrations commonly used in other molecular biological reactions 
are also contemplated. 

It is desirable that an INVADER oligonucleotide be immediately available to 
direct the cleavage of each probe oligonucleotide that hybridizes to a target nucleic acid. 
In some embodiments described herein, the INVADER oligonucleotide is provided in 
25 excess over the probe oligonucleotide. While this is an effective means of making the 
INVADER oligonucleotide immediately available in such embodiments it is not intended 
that the practice of the present invention be limited to conditions wherein the INVADER 
oligonucleotide is in excess over the probe, or to any particular ratio of INVADER- to- 
probe (e.g., in some preferred embodiments described herein, the probe is provided in 
30 excess over the INVADER oligonucleotide). Another means of assimng the presence of 
an INVADER oligonucleotide whenever a probe binds to a target nucleic acid is to 
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design the INVADER oligonucleotide to hybridize more stably to the target, Le., to have 
a higher Tm than the probe. This can be accomplished by any of the means of increasing 
nucleic acid duplex stability discussed herein (e.g,, by increasing the amount of 
complementarity to the target nucleic acid). 
5 Buffer conditions should be chosen that are compatible with both the 

oligonucleo tide/target hybridization and with the activity of the cleavage agent. The 
optimal buffer conditions for nucleic acid modification enzymes, and particularly DNA 
modification enzymes, generally included enough mono- and di-valent salts to allow 
association of nucleic acid strands by base-pairing. If the method of the present invention 
10 is performed using an enzymatic cleavage agent other than those specifically described 
here, the reactions may generally be performed in any such buffer reported to be optimal 
O for the nuclease function of the cleavage agent. In general, to test the utility of any 

m cleavage agent in this method, test reactions are performed wherein the cleavage agent of 

^fj interest is tested in the MOPS/MnCl2/KCl buffer or Mg-containing buffers described 

1 5 herein and in whatever buffer has been reported to be suitable for use with that agent, in a 
5? manufacturer's data sheet, a journal article, or in personal conLLmunication. 

^ The products of the INVADER oligonucleotide-directed cleavage reaction are 

111 fragments generated by structure-specific cleavage of the input oligonucleotides. The 

resulting cleaved and/or uncleaved ohgonucleotides may be analyzed and resolved by a 
C 20 number of methods including, but not limited to, electrophoresis (on a variety of supports 
including acrylamide or agarose gels, paper, etc.), chromatography, fluorescence 
polarization, mass spectrometry and chip hybridization. In some Examples the invention 
is illustrated using electrophoretic separation for the analysis of the products of fhe 
cleavage reactions. However, it is noted that the resolution of the cleavage products is 
25 not limited to electrophoresis. Electrophoresis is chosen to illustrate the method of the 
invention because electrophoresis is widely practiced in the art and is easily accessible to 
the average practitioner. In other Examples, the invention is illustrated without 
electrophoresis or any other resolution of the cleavage products. 

The probe and INVADER oligonucleotides may contain a label to aid in their 
30 detection following the cleavage reaction. The label may be a radioisotope (e.g., a ^^P or 

35 

S-labelled nucleotide) placed at either the 5' or 3' end of the oHgonucleotide or 
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alternatively, the label may be distributed throughout the oligonucleotide (i.e., a 
uniformly labeled oligonucleotide). The label may be a nonisotopic detectable moiety, 
such as a fluorophore, that can be detected directly, or a reactive group that permits 
specific recognition by a secondary agent. For example, biotinylated oligonucleotides 

5 may be detected by probing with a streptavidin molecule that is coupled to an indicator 
(e.g., alkaline phosphatase or a fluorophore) or a hapten such as dioxigenin may be 
detected using a specific antibody coupled to a similar indicator. The reactive group may 
also be a specific configuration or sequence of nucleotides that can bind or otherwise 
mteract with a secondary agent, such as another nucleic acid, and enzyme, or an antibody. 

10 In some embodiments, a probe is labeled with fluorescing moiety and a quenching 

moiety, wherein cleavage of the cleavage structure separates the fluorescing moiety from 
the quenching moiety, resulting in a detectable signal (e.g., FRET detection). In some 
embodiments, a change in quenching of signal from a donor fluorophor is detected, while 
in other embodiments, a change in emission from an acceptor fluorophore is detected. In 

1 5 still other embodiments, the effect of FRET on both donor and acceptor emissions are 
detected. 

In some embodiments of FRET detection, the fluorescence lifetime of the fluorescence 
emitter is measured {e.g., as in time-resolved fluorescence). While not limiting time- 
resolved fluorescence detection embodiments to any particular labeling systems, 
20 examples of tags that are useful ui time-resolved FRET measurements include europium 
chelate (Eu^^; Biosclair, et al. J. Biomolecular Screening 5(5):319 [2000]), europium 
trisbipyridine cryptate (TBPEu''^; Alpha-Bazin, et al, Anal. Biochem. 286(1): 17 [2000]), 
and ruthenium ligand complex {[Ru(bpy)2(phen-ITC)]^^; Youn, et al. Anal. Biochem. 
232(1):24 [1995]; Lakowicz, et al. Anal. Biochem. 288:62 [2001]). 

25 c) Optimization Of Reaction Conditions 

The DSIVADER oUgonucleotide-directed cleavage reaction is useful to detect the 
presence of specific nucleic acids. In addition to the considerations listed above for the 
selection and design of the INVADER and probe ohgonucleotides, the conditions under 
which the reaction is to be performed may be optimized for detection of a specific target 
30 sequence. 
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One objective in optimizing the INVADER oligonucleotide-directed cleavage 
assay is to allow specific detection of the fewest copies of a target nucleic acid. To 
achieve this end, it is desirable that the combined elements of the reaction interact with 
the maximum efficiency, so that the rate of the reaction (e.g., the number of cleavage 
5 events per minute) is maximized. Elements contributing to the overall efficiency of the 
reaction include the rate of hybridization, the rate of cleavage, and the efficiency of the 
release of the cleaved probe. 

The rate of cleavage will be a function of the cleavage means chosen, and may be 
made optimal according to the manufacturer's instructions when using commercial 
10 preparations of enzymes or as described in the examples herein. The other elements (rate 
of hybridization, efficiency of release) depend upon the execution of the reaction, and 
optimization of these elements is discussed below. 

Three elements of the cleavage reaction that significantly affect the rate of nucleic 
acid hybridization are the concentration of the nucleic acids, the temperature at which the 
1 5 cleavage reaction is performed and the concentration of sahs and/ or other charge- 
shielding ions in the reaction solution. 

The concentrations at which oligonucleotide probes are used in assays of this type 
are well known in the art, and are discussed above. One example of a common approach 
to optimizing an oligonucleotide concentration is to choose a starting amount of 
20 ohgonucleotide for pilot tests; 0.01 to 2 |J,M is a concentration range used in many 
oligonucleotide-based assays. When initial cleavage reactions are performed, the 
following questions may be asked of the data: Is the reaction performed in the absence of 
the target nucleic acid substantially free of the cleavage product?; Is the site of cleavage 
specifically positioned in accordance with the design of the INVADER oligonucleotide?; 
25 Is the specific cleavage product easily detected in the presence of tiie uncleaved probe (or 
is the amount of uncut material overwhelming the chosen visualization method)? 

A negative answer to any of these questions would suggest that the probe 
concentt-ation is too high, and that a set of reactions using serial dilutions of the probe 
should be performed until the appropriate amount is identified. Once identified for a 
30 given target nucleic acid in a give sample type (e.g., purified genomic DNA, body fluid 
extract, lysed bacterial extract), it should not need to be re-optimized. The sample type is 
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important because the complexity of the material present may influence the probe 
concentration optimum. 

Conversely, if the chosen initial probe concentration is too low, the reaction may 
be slow, due to inefficient hybridization. Tests with increasing quantities of the probe 
5 will identify the point at which the concentration exceeds the optimum (e.g,, at which it 
produces an undesirable effect, such as background cleavage not dependent on the target 
sequence, or interference with detection of the cleaved products). Since the hybridization 
will be facihtated by excess of probe, it is desirable, but not required, that the reaction be 
performed using probe concentrations just below this point, 
1 0 The concentration of INVADER oligonucleotide can be chosen based on the 

design considerations discussed above. La some embodiments, the INVADER 
O ohgonucleotide is in excess of the probe oligonucleotide. In a preferred embodiment, the 

gj probe oligonucleotide is in excess of the INVADER ohgonucleotide. 

%1 Temperature is also an important factor in the hybridization of oligonucleotides, 

f • 1 5 The range of temperature tested will depend in large part on the design of the 
m oligonucleotides, as discussed above. Where it is desired to have a reaction be run at a 

L particular temperature (e.g., because of an enzyme requirement, for convenience, for 

III compatibility with assay or detection apparatuses, etc.), the oligonucleotides that function 

^ in the reaction can be designed to optimally perform at the desired reaction temperature, 

p 20 Each INVADER reaction includes at least two target sequence-specific oligonucleotides 
for the primary reaction: an upstream INVADER ohgonucleotide and a downstream 
probe oligonucleotide. In some preferred embodiments, the INVADER oligonucleotide 
is designed to bind stabily at the reaction temperature, while the probe is designed to 
freely associate and disassociate with the target strand, with cleavage occurring only 
25 when an uncut probe hybridizes adjacent to an overlapping INVADER oligonucleotide. 
In preferred embodiments, the probe includes a 5' flap that is not complementary to the 
target, and this flap is released fi-om the probe when cleavage occurs. The released flap 
can be detected directly or indirectly. In some preferred embodiments, as discussed in 
detail below, the released flap participate as in INVADER ohgonucleotide in a secondary 
30 reaction. 
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Optimum conditions for the E^JVADER assay are generally those that allow 
specific detection of the smallest amount of a target nucleic acid. Such conditions may 
be characterized as those that yield the highest target-dependent signal in a given 
timeframe, or for a given amount of target nucleic acid, or that allow the highest rate of 

5 probe cleavage (Le., probes cleaved per minute). 

As noted above, the concentration of the cleavage agent can affect the actual 
optimum temperature for a cleavage reaction. Additionally, different cleavage agents, 
even if used at identical concentrations, can affect reaction temperature optima differently 
(e.g., the difference between the calculated probe Tm and the observed optimal reaction 

1 0 temperature may be greater for one enzyme than for another). Determination of 

appropriate salt corrections for reactions using different enzymes or concentrations of 
enzymes, or for any other variation made in reaction conditions, involves a two step 
process of a) measuring reaction temperature optima under the new reaction conditions, 
and varying the salt concentration within the Tm algorithm to produce a calculated 

1 5 temperature matching or closely approximating the observed optima. Measurement of an 
optimum reaction temperature generally involves performing reactions at a range of 
temperatures selected such that the range allows observation of an increase in 
performance as an optimal temperature is approached (either by increasing or decreasing 
temperatures), and a decrease in performance when an optimal temperature has been 

20 passed, thereby allowing identification of the optimal temperature or temperature range 
(See e.g., Lyamichev, et al., Biochemistry 39: 9523 [2000]). 

In some embodiments, a secondary reaction is used where the released cleavage 
fragment from a primary reaction hybridizes to a synthetic cassette to form a secondary 
cleavage reaction. In some preferred embodiments, the cassette comprises a fluorescing 

25 moiety and a quenching moiety, wherein cleavage of the secondary cleavage structure 
separates the fluorescing moiety from the quenching moiety, resulting in a detectable 
signal (e.g., FRET detection). The secondary reaction can be configured a number of 
different ways. For example, in some embodiments, the synthetic cassette comprises two 
oligonucleotides: an oUgonucleotide that contains the FRET moieties and a 

30 FRET/INVADER oligonucleotide bridging ohgonucleotide that allows the INVADER 
oligonucleotide (i.e., the released flap from the primary reaction) and the FRET 
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oligonucleotide to hybridize thereto, such that a cleavage structure is formed. In some 
embodiments, the synthetic cassette is provided as a single oligonucleotide, comprising a 
hairpin structure (i.e., the FRET oligonucleotide is connected at its 3* end to the bridging 
oUgonucleotide by a loop). The loop may be nucleic acid, or a non-nucleic acid spacer or 

5 Unker. The linked molecules may together be described as a FRET cassette. In the 
secondary reaction using a FRET cassette the released flap from the primary reaction, 
which acts as an INVADER oligonucleotide, should be able to associate and disassociate 
with the FRET cassette freely, so that one released flap can direct the cleavage of 
multiple FRET cassettes. It is one aspect of the assay design that all of the probe 

1 0 sequences may be selected to allow the primary and secondary reactions to occur at the 
same optimal temperature, so that the reaction steps can run simultaneously. In an 
alternative embodiment, the probes may be designed to operate at different optimal 
temperatures, so that the reaction steps are not simultaneously at their temperature 
optima. As noted above, the same iterative process used to select the ASR of the probe 

1 5 can be used in the design of the portion of the primary probe that participates in a 
secondary reaction. 

Another determinant of hybridization efficiency is the salt concentration of the 
reaction. In large part, the choice of solution conditions will depend on the requirements 
of the cleavage agent, and for reagents obtained commercially, the manufacturer's 

20 instructions are a resource for this information. When developing an assay utilizing any 
particular cleavage agent, the oligonucleotide and temperature optimizations described 
above should be performed in the buffer conditions best suited to that cleavage agent. 

In some embodiments, additional agents may be included in reaction mixtures to 
enhance assay performance. For example, charged compounds such as aminoglycosides 

25 and other polyamines have been used to modulate DNA and RNA conformation and 
function (see, e.g., Eamshaw and Gait, Nucl. Acids Res. 26:5551 [1998]; Robinson and 
Wang, Nucl Acids Res, 24:676 [1996]; Jerinie, J. Mol. Biol 304(5):707 [2000]; 
Schroeder et aL, EMBO 19(1): 1 [2000]). Inclusion of the aminoglycoside antibiotic 
neomycin sulfate (e,g., at 1 |M in a primary reaction) can enhance assay performance by, 

30 e.g., reducing background signal, and therefore reducing the limit of detection of a 
particular INVADER assay probe set. Compounds of this type that may find use in 
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INVADER assay reactions include, but are not limited to, aminoglycosides, oligomerized 
aminoglycosides, and aminoglycoside bioconj agates, and other polyanions including, but 
not limited to, spermine and hexaamine cobalt. 

A "no enzyme" control allows the assessment of the stability of the labeled 
5 ohgonucleotides under particular reaction conditions, or in the presence of the sample to 
be tested e,g., in assessing the sample for contaminating nucleases). In this manner, the 
substrate and ohgonucleotides are placed in a tube containing all reaction components, 
except the enzyme and treated the same as the enzyme-containing reactions. Other 
controls may also be included. For example, a reaction with all of the components except 
10 the target nucleic acid will serve to confirm the dependence of the cleavage on the 
presence of the target sequence. 

d) Selection of a Cleavage Agent 

As demonstrated in a number of the Examples, some 5' nucleases do not require 
an upstream oligonucleotide to be active in a cleavage reaction. Although cleavage may 

15 be slower without the upstream oligonucleotide, it may still occur (Lyamichev et aL, 

Science 260:778 [1993], Kaiser et al, J, Biol. Chem., 274:21387 [1999]). When a DNA 
strand is the template or target strand to which probe oligonucleotides are hybridized, the 
5' nucleases derived from DNA polymerases and some flap endonucleases (FENs), such 
as that from Methanococcus jannaschii, can cleave quite well without an upstream 

20 oligonucleotide providing an overlap (Lyamichev et al. Science 260:778 [1993], Kaiser 
et al, J. Biol Chem., 274:21387 [1999], and US Patent No. 5,843,669, herein 
incorporated by reference in its entirety). These nucleases may be selected for use in 
some embodiments of the INVADER assay, e.g., in embodiments wherein cleavage of 
the probe in the absence of an INVADER oligonucleotide gives a different cleavage 

25 product, which does not interfere with the intended analysis, or wherein both types of 
cleavage, INVADER oligonucleotide-directed and INVADER oligonucleotide- 
independent, are intended to occur. 

In other embodiments it is preferred that cleavage of the probe be dependent on 
the presence of an upstream INVADER oligonucleotide, and enzyme having this 

30 requirement would be used. Other FENs, such as those from Archeaoglobus fulgidus 
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(Afu) and Pyrococcus furiosus (Pfu), cleave an overlapped structure on a DNA target at 
so much greater a rate than they do a non-overlapping structure {i.e., either missing the 
upstream oligonucleotide or having a non-overlapping upstream oligonucleotide) that 
they can be viewed as having an essentially absolute requirement for the overlap 

5 (Lyamichev et al., Nat. BiotechnoL, 1 7:292 [1999], Kaiser et al., J. Biol. Chem., 

274:21387 [1999]). When an RNA target is hybridized to DNA oligonucleotide probes 
to form a cleavage structure, many FENs cleave the downstream DNA probe poorly, 
regardless of the presence of an overlap. On such an RNA-containing structure, the 5' 
nucleases derived from DNA polymerases have a strong requirement for the overlap, and 

1 0 are essentially inactive in its absence. The selection of enymes for use in the detection of 
RNA targets is discussed in more detail below, in Section IV: Improved Enzymes For 
Use In INVADER Oligonucleotide-Directed Cleavage Reactions Comprising RNA 
Targets. 

e) Probing For Multiple Alleles 

1 5 The INVADER oligonucleotide-directed cleavage reaction is also useful in the 

detection and quantification of individual variants or alleles in a mixed sample 
population. By way of example, such a need exists in the analysis of tumor material for 
mutations in genes associated with cancers. Biopsy material from a tumor can have a 
significant complement of normal cells, so it is desirable to detect mutations even when 

20 present in fewer than 5% of the copies of the target nucleic acid in a sample. In this case, 
it is also desirable to measure what fraction of the population carries the mutation. 
Similar analyses may also be done to examine allelic variation in other gene systems, and 
it is not intended that the method of the present invention by limited to the analysis of 
tumors. 

25 

As demonstrated below, in one embodiment, reactions can be performed under 
conditions that prevent the cleavage of probes bearing even a single-nucleotide difference 
mismatch, but that permit cleavage of a similar probe that is completely complementary 
to the target in this region. In a preferred embodiment, a mismatch is positioned at the 
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nucleotide in the probe that is 5' of the site where cleavage occurs in the absence of the 
mismatch. 

In other embodiments, the INVADER assay may be performed under conditions 
that have a tight requirement for an overlap (e.g., using the Afu FEN for DNA target 

5 detection or the 5' nuclease of DNA polymerase for RNA target detection, as described 
above), providing an alternative means of detecting single nucleotide or other sequence 
variations. In one embodiment, the probe is selected such that the target base suspected 
of varying is positioned at the 5' end of the target-complementary region of this probe. 
The upstream INVADER oligonucleotide is positioned to provide a single base of 

1 0 overlap. If the target and the probe oligonucleotide are complementary at the base in 
question, the overlap forms and cleavage can occur. However, if the target does not 
complement the probe at this position, that base in the probe becomes part of a non- 
complementary 5' arm, no overlap between the INVADER ohgonucleotide and probe 
ohgonucleotide exists, and cleavage is suppressed. 

15 It is also contemplated that different sequences may be detected in a single 

reaction. Probes specific for the different sequences may be differently labeled. For 
example, the probes may have different dyes or other detectable moieties, different 
lengths, or they may have differences in net charges of the products after cleavage. When 
differently labeled in one of these ways, the contribution of each specific target sequence 

20 to final product can be tallied. This has application in detecting the quantities of different 
versions of a gene within a mixture. Different genes in a mixture to be detected and 
quantified may be wild type and mutant genes {e.g., as may be found in a tumor sample, 
such as a biopsy). In this embodiment, one might design the probes to precisely the same 
site, but one to match the wild-type sequence and one to match the mutant. Quantitative 

25 detection of the products of cleavage from a reaction performed for a set amount of time 
will reveal the ratio of the two genes in the mixture. Such analysis may also be 
performed on unrelated genes in a mixture. This type of analysis is not intended to be 
limited to two genes. Many variants within a mixture may be similarly measured. 

Alternatively, different sites on a single gene may be monitored and quantified to 

30 verify the measurement of that gene. In this embodiment, the signal from each probe 
would be expected to be the same. 
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It is also contemplated that multiple probes may be used that are not differently 
labeled, such that the aggregate signal is measured. This may be desirable when using 
many probes designed to detect a single gene to boost the signal from that gene. This 
configuration may also be used for detecting unrelated sequences within a mix. For 

5 example, in blood banking it is desirable to know if any one of a host of infectious agents 
is present in a sample of blood. Because the blood is discarded regardless of which agent 
is present, different signals on the probes would not be required in such an application of 
the present invention, and may actually be undesirable for reasons of confidentiality. 

Just as described for the two-oligonucleotide system, above, the specificity of the 

1 0 detection reaction will be influenced by the aggregate length of the target nucleic acid 
sequences involved in the hybridization of the complete set of the detection 
oligonucleotides. For example, there may be applications in which it is desirable to 
detect a single region within a complex genome. In such a case the set of 
oligonucleotides may be chosen to require accurate recognition by hybridization of a 

1 5 longer segment of a target nucleic acid, often in the range of 20 to 40 nucleotides. In 
other instances it may be desirable to have the set of oligonucleotides interact with 
multiple sites within a target sample. In these cases one approach would be to use a set 
of oligonucleotides that recognize a smaller, and thus statistically more common, 
segment of target nucleic acid sequence. 

20 In one preferred embodiment, the INVADER and stacker oUgonucleotides may be 

designed to be maximally stable, so that they will remain bound to the target sequence for 
extended periods during the reaction. This may be accomplished through any one of a 
number of measures well known to those skilled in the art, such as adding extra 
hybridizing sequences to the length of the oligonucleotide (up to about 50 nts in total 

25 length), or by using residues with reduced negative charge, such as phosphorothioates or 
peptide-nucleic acid residues, so that the complementary strands do not repel each other 
to degree that natural strands do. Such modifications may also serve to make these 
flanking oligonucleotides resistant to contaminating nucleases, thus further ensuring their 
continued presence on the target strand during the course of the reaction. In addition, the 

30 INVADER and stacker oligonucleotides may be covalently attached to the target (eg., 
through the use of psoralen cross-linking). 
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f) Applications for pooled DNA and RNA samples 

In some embodiments, the present invention provides methods and kits for 
assaying a pooled sample using INVADER detection reagents (e.g. primary probe, 
INVADER probe, and FRET cassette). In some preferred embodiments, the kit 
5 comprises instructions on how to perform the INVADER assay and specifically how to 
apply the INVADER detection assay to pooled samples from many individuals, or to 
"pooled" samples from many cells (e.g. from a biopsy sample) from a single subject. 

In particular embodiments, the present invention allows detection of 
polymorphims in pooled samples combined from many individuals in a population (e.g. 

10 10, 50, 100, or 500 individuals), or from a single subject where the nucleic acid 

sequences are from a large number of cells that are assayed at once. In this regard, the 
present invention allows the frequency of rare mutations in pooled samples to be detected 
and an allele frequency for the population established. In some embodiments, this allele 
frequency may then be used to statistically analyze the results of applying the EWADER 

15 detection assay to an individual's frequency for the polymorphism (e.g. determined using 
the INVADER assay). In this regard, mutations that rely on a percent of mutants found 
(e.g. loss of heterozygozity mutations) may be analyzed, and the severity of disease or 
progression of a disease determined (See, e.g. US Patent 6,146,828 and 6,203,993 to 
Lapidus, hereby incorporated by reference for all purposes, where genetic testing and 

20 statistical analysis are employed to find disease causing mutations or identify a patient 
sample as containing a disease causing mutations). 

In some embodiments of the present invention, broad population screens are 
performed. In some preferred embodiments, pooling DNA from several hundred or a 
thousand individuals is optimal. In such a pool, for example, DNA from any one 

25 individual would not be detectable, and any detectable signal would provide a measure of 
frequency of the detected allele in a broader population. The amount of DNA to be used, 
for example, would be set not by the number of individuals in a pool, but rather by the 
allele frequency to be detected. For example, in some embodiments, an assay gives 
ample signal from 20 to 40 ng of DNA in a 90 minute reaction. At this level of 

30 sensitivity, analysis of 1 \xg of DNA from a high-complexity pool would produce 
comparable signal from alleles present in only about 3-5% of the population. 
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g) Applications of RNA detection, 

RNA quantitation is becoming increasingly important in basic, pharmaceutical, 
and clinical research. For example, quantitation of viral RNAs can predict disease 
5 progression and therapeutic efficacy. Likewise, gene expression analysis of diseased vs, 
normal, or untreated vs. treated, tissue can identify relevant biological responses or assess 
the effects of pharmacological agents. As the focus of the Human Genome Project 
moves toward gene expression analysis, the field will require a flexible RNA analysis 
technology that can quantitatively monitor multiple forms of ahematively transcribed 
1 0 and/or processed RNAs. 

As decribed above for the detection of multiple alleles, multiplex formats of the 
RNA INVADER assay enable simultaneous expression analysis of two or more genes 
within the same sample. In a primary reaction, one-nucleotide overlap-substrates are 
generated by the hybridization of INVADER oligonucleotides and probe oligonucleotides 
15 to their respective RNA targets. Each probe contains a specific, target-complementary 
region and a distinctive non-complementary 5' flap that is associated only with that 
specific mRNA in that assay. The distinctive flaps may be distinguished in any of the 
myriad ways disclosed herein (e.g., with different labels, different secondary cleavage 
systems having different labels, specific antibodies, different sizes when resolved, 
20 differenct sequences detected by hybridization in solution or on surfaces, etc.) 

While the RNA invasive cleavage assay, like the method used for DNA detection 
described above, can use two invasive cleavage reactions in sequence (described below, 
in Section II of the Detailed Description of the Invention), its preference for the 5' 
nucleases derived from DNA polymerases (described in detail in Section IV of the 
25 Detailed Description of the Invention) indicates that additional format changes are 
preferred. Unlike the FEN 5' nucleases generally used for detection of DNA targets, 
optimal signal amplification with the DNA Pol-related 5' nucleases occurs only when a 
probe turnover mechanism is employed in both the primary and secondary reactions (in 
contrast to an INVADER ohgonucleotide turnover mechanism, wherein an INVADER 
30 oligonucleotide cycles, e.g. , to direct the cleavage of multiple FRET cassettes, as 
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described below, in Section II of this Detailed Description). Consequently, in preferred 
embodiments, RNA detection uses sequential operation of the two reactions, rather than 
simultaneous reaction performance. Because the reactions are performed truly 
sequentially, in these embodiments, the RNA INVADER assay signal accumulates 
5 lineariy in both a target- and time-dependent manner. In contrast, the primary and 

secondary reactions of the DNA INVADER assay, when run concurrently, amplify signal 
as a linear function of target level, but as a quadratic function of time. In the sequential 
embodiments, the RNA INVADER assay uses two separate oligonucleotides, a secondary 
probe {e.g., a FRET probe) and secondary target, for signal generation. 
10 A key feature of the RNA invasive cleavage assay is its ability to discriminate 

highly homologous RNA sequences, such as those found in cytochrome P450 gene 
families. Like the DNA INVADER assay, the RNA INVADER assay can discriminate 
single-base changes. In some embodiments, the first 5' complementary base of each 
probe is positioned at a non-conserved site in its mRNA target, so that a mismatch 
1 5 prevents formation of the overlap-structure, and thus prevents cleavage of the probe. 
Alternatively spliced mRNA variants can be specifically detected by positioning the 
cleavage site at a splice junction. 

To monitor large changes in mRNA levels, the dynamic range of the assay can be 
extended using real-time analysis. However, since the assay generates signal linearly 
20 with time or target level, simply varying the amount of sample added per reaction and 
calculating the copies of mRNA per ng total RNA enables accurate quantitation with a 
single endpoint measurement on low-cost instrumentation. Further, in cases where 
absolute quantitation is not necessary, the assay's linear signal amplification mechanism 
and reproducibility also eliminate the need for a standard curve and enable simple and 
25 precise relative quantitation of any one gene. 

The RNA INVADER assay is particulariy suited for detecting alternatively 
spliced or edited RNA variants because even a single base change at the overiap site 
affects 5' nuclease cleavage. All areas requiring RNA quantitation, such as high- 
throughput screening in drug discovery research, monitoring drug metabolism and safety 
30 in clinical trials, and clinical load monitoring of viral RNA can use this technology. 
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splice variants can be monitored in at least two ways with the assay: 1) detection of an 
individual exon or 2) detection of a specific splice junction. 

To examine an RNA population for variants having more or fewer exons after 
splicing, INVADER assay probe sets are designed for each of the exons of interest (or for 
all exons in the mature RNA). Quantitation of exons, independent of how many mRNAs 
they reside in, may provide information about the number of splice variants for a given 
gene, as well as indicate the levels of expression for each exon. Mini in vitro transcripts 
containing only one or a few exons can be generated for each probe set so that absolute 
quantitation can be performed for each exon, thus enabUng accurate comparisons of exon 
levels. If it is known that a particular exon is present in all known variants, in some 
embodiments, a probe set is designed for that exon for use as an internal control to 
normalize across different samples. RNAs having a one copy of each exon (e.g., 
"normally spliced" RNA) should produce signal from the collection of probe sets in 
certain relative amounts (which should be esseintially equal for all exons, corrected for 
variations in the sensitivity of individual probe sets; see Section V). Alterations in 
splicing alter the relative amounts of the exons. For example, if all of the produced 
RNAs are missing one of the normal exons, the signal for that exon drops toward zero, 
while if half of the RNAs are missing that exon, the signal for that exon drops toward 
50%. More complex combinations of splice variations and mixtures of differently 
spliced mRNAs yield more complex and more informative profiles. Detection is not 
limited to exons. RNA populations may also be monitored for the presence of intron 
sequences that are usually removed by sphcing. Such global exon screening provides 
biologically relevant or diagnostic information when comparing normal vs. diseased 
tissue or untreated vs. treated cells (e.g., in pharmacogenomic screening assays). An 
array-based description of this type of measurement is referred to as alternative sphcing 
detector arrays (D. Black, Cell 103:367 [2000]). It is contemplated that the mRNA 
INVADER assay gives similar results but with greater specificity and more accurate 
quantitation than the ohgonucleotide array formats. 

In an alternative embodiment, alternatively spliced mRNAs is detected by 
examiniation of specific spUce junctions. The advantage of monitoring the splice sites, as 
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opposed to the exons themselves, is that even splice variants involving very small exons 
(e.g. < 10 nts) are accurately detected with the assay. 

Additionally, in some embodiments, the mRNA INVADER assay isalso used to 
monitor altemative start and stop sites in the mRNA, and is used to monitor lifetimes of 
5 processed and unprocessed RNAs and RNA fragments (e.g. , as used in timecourse 
studies following induction). 

IL Signal Enhancement By Incorporating The Products Of An Invasive 
Cleavage Reaction Into A Subsequent Invasive Cleavage Reaction 

10 

As noted above, the oUgonucleotide product released by the invasive cleavage can 
be used subsequently in any reaction or read-out method that uses oUgonucleotides in the 
size range of a cleavage product. In addition to the reactions involving primer extension 
and transcription, described herein, another enzymatic reaction that makes use of 

15 oligonucleotides is the invasive cleavage reaction. The present invention provide means 
of using the oligonucleotide released in a primary invasive cleavage reaction as a 
component to complete a cleavage structure to enable a secondary invasive cleavage 
reaction. ITis not intended that the sequential use of the invasive cleavage product be 
hmited to a single additional step. It is contemplated that many distinct invasive cleavage 

20 reactions may be performed in sequence. 

The polymerase chain reaction uses a DNA replication method to create copies of 
a targeted segment of nucleic acid at a logarithmic rate of accumulation. This is made 
possible by the fact that when the strands of DNA are separated, each individual strand 
contains sufficient information to allow assembly of a new complementary strand. When 

25 the new strands are synthesized the number of identical molecules has doubled. Within 
20 iterations of this process, the original may be copied 1 million-fold, making very rare 
sequences easily detectable. The mathematical power of a doubling reaction has been 
incorporated into a number of amplification assays. 

By performing multiple, sequential invasive cleavage reactions the method of the 

30 present invention captures an exponential mathematical advantage without producing 
additional copies of the target analyte. In a simple invasive cleavage reaction the yield. 
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Y, is simply the turnover rate, K, multiplied by the time of the reaction, t (Le., Y =^ 
(K)(t)). If Y is used to represent the yield of a simple reaction, then the yield of a 
compound {i.e., a multiple, sequential reaction), assuming that each of the individual 
invasive cleavage steps has the same turnover rate, can be simply represented as Y"", 
where n is the number of invasive cleavage reactions that have been performed in the 
series. If the yields of each step differ the ultimate yield can be represented as the 
product of the multiplication of the yields of each individual reaction in the series. For 
example, if a primary invasive cleavage reaction can produce one thousand products in 
30 minutes, and each of those products can in turn participate in 1000 additional 
reactions, there will be 1000^ copies (1000 x 1000) of the ultimate product in a second 
reaction. If a third reaction is added to the series, then the theoretical yield will be 1000^ 
(1000 X 1000 X 1000), In the methods of the present invention the exponent comes from 
the number of invasive cleavage reactions in the cascade. This can be contrasted to the 
amphfication methods described above (e,g., PGR) in which Y is limited to 2 by the 
number of strands in duplex DNA, and the exponent n is the number of cycles performed, 
so that many iterations are necessary to accumulate large amounts of product. 

To distinguish the exponential amplifications described above from those of the 
present invention, the former can be considered reciprocating reactions because the 
products the reaction feed back into the same reaction (e.g,, event one leads to some 
number of events 2, and each event 2 leads back to some number of events 1). In 
contrast, the events in some embodiments of the present invention are sequential (e.g., 
event 1 leads to some number of events 2; each event 2 leads to some number of events 3, 
etc., and no event can contribute to an event earlier in the chain). 

The sensitivity of the reciprocating methods is also one of the greatest weaknesses 
when these assays are used to determine if a target nucleic acid sequence is present or 
absent in a sample. Because the product of these reactions is detectable copies of the 
starting material, contamination of a new reaction with the products of an earlier reaction 
can lead to false positive results, (i.e., the apparent detection of the target nucleic acid in 
samples that do not actually contain any of that target analyte). Furthermore, because the 
concentration of the product in each positive reaction is so high, amounts of DNA 
sufficient to create a strong false positive signal can be communicated to new reactions 
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very easily either by contact with contaminated instruments or by aerosol. In contrast to 
the reciprocating methods, the most concentrated product of the sequential reaction (Le., 
the product released in the ultimate invasive cleavage event) is not capable of initiating a 
like reaction or cascade if carried over to a fresh test sample. This is a marked advantage 
over the exponential amplification methods described above because the reactions of the 
present invention may be performed without the costly containment arrangements (e,g., 
either by specialized instruments or by separate laboratory space) required by any 
reciprocating reaction. While the products of a penultimate event may be inadvertently 
transferred to produce a background of the ultimate product in the absence of the a target 
analyte, the contamination would need to be of much greater volume to give an 
equivalent risk of a false positive result. 

When the term sequential is used it is not intended to limit the invention to 
configurations in which that one invasive cleavage reaction or assay must be completed 
before the initiation of a subsequent reaction for invasive cleavage of a different probe. 
Rather, the term refers to the order of events as would occur if only single copies of each 
of the oligonucleotide species were used in an assay. The primary invasive cleavage 
reaction refers to that which occurs first, in response to the formation of the cleavage 
structure on the target nucleic acid. Subsequent reactions may be referred to as 
secondary, tertiary and so forth, and may involve artificial "target" strands that serve only 
to support assembly of a cleavage structure, and which are unrelated to the nucleic acid 
analyte of interest. While the complete assay may, if desired, be configured with each 
step of invasive cleavage separated either in space (e.g., in different reaction vessels) or 
in time (e.g., using a shift in reaction conditions, such as temperature, enzyme identity or 
solution condition, to enable the later cleavage events), it is also contemplated that all of 
the reaction components may be mixed so that secondary reactions may be initiated as 
soon as product fi*om a primary cleavage becomes available. In such a format, primary, 
secondary and subsequent cleavage events involving different copies of the cleavage 
structures may take place simultaneously. 

Several levels of this sort of linear amplification can be envisioned, in which each 
successive round of cleavage produces an oligonucleotide that can participate in the 
cleavage of a different probe in subsequent rounds. The primary reaction would be 
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specific for the analyte of interest with secondary (and tertiary, etc.) reactions being used 
to generate signal while still being dependent on the primary reaction for initiation. 

The released product may perform in several capacities in the subsequent 
reactions. For example, the product of one invasive cleavage reaction becomes the 

5 INVADER oligonucleotide to direct the specific cleavage of another probe in a second 
reaction. In such an example, the first invasive cleavage structure is formed by the 
anneahng of the INVADER oligonucleotide and the probe oligonucleotide (Probe 1) to 
the first target nucleic acid (Target 1). The target nucleic acid is divided into three 
regions based upon which portions of the INVADER and probe oligonucleotides are 

10 capable of hybridizing to the target. Region 1 of the target has complementarity to only 
the INVADER oUgonucleotide; region 3 of the target has complementarity to only the 
probe; and region 2 of the target has an overlap between the INVADER and probe 
oligonucleotides. 

Cleavage of Probe 1 releases the '*Cut Probe 1". The released Probe 1 is then 

15 used as the INVADER oligonucleotide in second cleavage. The second cleavage 

structure is formed by the annealing of the Cut Probe 1, a second probe ohgonucleotide 
("Probe 2") and a second target nucleic acid ("Target 2"). In some embodiments, Probe 2 
and the second target nucleic acid are covalently connected, preferably at their 3' and 5* 
ends, respectively, thus forming a hairpin stem and loop, termed herein a "cassette". The 

20 loop may be nucleic acidor a non-nucleic acid spacer or linker. Inclusion of an excess of 
the cassette molecule allows each Cut Probe 1 to serve as an INVADER to direct the 
cleavage of multiple copies of the cassette. 

Probe 2 may be labeled and detection of cleavage of the second cleavage structure 
may be accomplished by detecting the labeled cut Probe 2; the label may a radioisotope 

25 (e.g,, ^^P, ^^S), a fluorophore (e.g., fluorescein), a reactive group capable of detection by a 
secondary agent (eg., biotin/streptavidin), a positively charged adduct which permits 
detection by selective charge reversal (as discussed in Section IV above), etc. 
Alternatively, the cut Probe 2 may used in a tailing reaction, or to complete or activate a 
protein-binding site, or may be detected or used by any of the means for detecting or 

30 using an oligonucleotide described herein. 
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In other embodiments, probe oligonucleotides that are cleaved in the primary 
reaction can be designed to fold back on themselves {i.e., they contain a region of self- 
complementarity) to create a molecule that can serve as both the INVADER and target 
oligonucleotide (termed here an "IT" complex). The IT complex then enables cleavage 

5 of a different probe present in the secondary reaction. Inclusion of an excess of the 
secondary probe molecule ("Probe 2"), allows each IT molecule to serve as the platform 
for the generation of multiple copies of cleaved secondary probe. The target nucleic acid 
is divided into three regions based upon which portions of the INVADER and probe 
oligonucleotides are capable of hybridizing to the target (as discussed above and it is 

10 noted that the target may be divided into four regions if a stacker oUgonucleotide is 
employed). The second cleavage structure is formed by the annealing of the second 
probe ("Probe 2") to the fragment of Probe 1 ("Cut Probe 1") that was released by 
cleavage of the first cleavage structure. The Cut Probe 1 forms a hairpin or stem/loop 
structure near its 3' terminus by virtue of the anneaUng of the regions of self- 

1 5 complementarity contained within Cut Probe 1 (this self-annealed Cut Probe 1 forms the 
IT complex). The IT complex (Cut Probe 1) is divided into three regions. Region 1 of 
the IT complex has complementarity to the 3' portion of Probe 2; region 2 has 
complementarity to both the 3' end of Cut Probe 1 and to the 5' portion of Probe 2; and 
region 3 contains the region of self-complementarity (i.e., region 3 is complementary to 

20 the 3' portion of the Cut Probe 1). Note that with regard to the IT complex (i.e., Cut 
Probe 1), region 1 is located upstream of region 2 and region 2 is located upstream of 
region 3. As for other embodiments of invasive cleavage, region "2" can represent a 
region where there is a physical, but not sequence, overlap between the INVADER 
ohgonucleotide portion of the Cut Probe 1 and the Probe 2 oligonucleotide. 

25 The cleavage products of the secondary invasive cleavage reaction (i.e.. Cut Probe 

2) can either be detected, or can in turn be designed to constitute yet another integrated 
INVADER-target complex to be used with a third probe molecule, again unrelated to the 
preceding targets. 

It is envisioned that the oligonucleotide product of a primary cleavage reaction 
30 may fill the role of any of the oligonucleotides described herein (e.g., it may serve as a 
target strand without an attached INVADER oligonucleotide-hke sequence, or it may 
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serve as a stacker oligonucleotide, as described above), to enhance the turnover rate seen 
in the secondary reaction by stabilizing the probe hybridization through coaxial stacking. 

Secondary cleavage reactions in some preferred embodiments of the present 
invention include the use of FRET cassettes. Such molecules provide both a secondary 
target and a FRET labeled cleavable sequence, allov^ing homogeneous detection (i.e., 
without product separation or other manipulation after the reaction) of the sequential 
invasive cleavage reaction. Other preferred embodiments use a secondary reaction 
system in which the FRET probe and synthetic target are provided as separate 
oligonucleotides. 

In a preferred embodiment, each subsequent reaction is facilitated by (/.e., is 
dependent upon) the product of the previous cleavage, so that the presence of the ultimate 
product may serve as an indicator of the presence of the target analyte. However, 
cleavage in the second reaction need not be dependent upon the presence of the product 
of the primary cleavage reaction; the product of the primary cleavage reaction may 
merely measurably enhance the rate of the second cleavage reaction. 

In summary, the INVADER assay cascade {i.e., sequential invasive cleavage 
reactions) of the present invention is a combination of two or more linear assays that 
allows the accumulation of the ultimate product at an exponential rate, but without 
significant risk of carryover contamination. It is important to note that background that 
does not arise from sequential cleavage, such as thermal breakage of the secondary probe, 
generally increases Hnearly with time. In contrast, signal generation from a 2-step 
sequential reaction follows quadratic kinetics. Thus, collection of data as a time course, 
either by taking time points or through the use of an instrument that allows real-time 
detection during the INVADER assay reaction incubations, provides the attractive 
capability of discriminating between the true signal and any background solely on the 
basis of quadratic versus hnear increases in signal over time. For example, when viewed 
graphically, the real signal will appear as a quadratic curve, while any accumulating 
background will be linear, and thus easy to distinguish, even if the absolute level of the 
background signal (e.g., fluorescence in a FRET detection format) is substantial 

The sequential invasive cleavage ampUfication of the present invention can be 
used as an intermediate boost to any of the detection methods (e.g., gel based analysis by 
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either standard or by charge reversal), polymerase tailing, and incorporation into a protein 
binding region, described herein. When used is such combinations the increased 
production of a specific cleavage product in the invasive cleavage assay reduces the 
burdens of sensitivity and specificity on the read-out systems, thus facilitating their use. 
5 In addition to enabling a variety of detection platforms, the cascade strategy is 

suitable for multiplex analysis of individual analytes (i.e., individual target nucleic acids) 
in a single reaction. The multiplex format can be categorized into two types. In one case, 
it is desirable to know the identity (and amount) of each of the analytes that can be 
present in a clinical sample, or the identity of each of the analytes as well as an internal 
10 control. To identify the presence of multiple individual analytes in a single sample, 

several distinct secondary amplification systems may be included. Each probe cleaved in 
response to the presence of a particular target sequence (or internal control) can be 
designed to trigger a different cascade coupled to different detectable moieties, such as 
different sequences to be extended by DNA polymerase or different dyes in an FRET 
1 5 format. The contribution of each specific target sequence to final product can thereby be 
tallied, allowing quantitative detection of different genes or alleles in a sample containing 
a mixture of genes or alleles. 

In the second configuration, it is desirable to determine if any of several analytes 
are present in a sample, but the exact identity of each does not need to be known. For 
20 example, in blood banking it is desirable to know if any one of a host of infectious agents 
is present in a sample of blood. Because the blood is discarded regardless of which agent 
is present, different signals on the probes would not be required in such an apphcation of 
the present invention, and may actually be undesirable for reasons of confidentiality. In 
this case, the 5' arms (i.e., the 5' portion that will be released upon cleavage) of the 
25 different analyte-specific probes would be identical and would therefore trigger the same 
secondary signal cascade. A similar configuration would permit multiple probes 
complementary to a single gene to be used to boost the signal fi-om that gene or to ensure 
inclusivity when there are numerous alleles of a gene to be detected. 

In the primary INVADER assay reaction, there are two potential sources of 
30 background. The fu-st is fi-om INVADER oligonucleotide-independent cleavage of probe 
annealed to the target, to itself, or to one of the other oligonucleotides present in the 
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reaction. The use of an enzyme that cannot efficiently cleave a structure that lacks a 
primer is preferred for this reason. The enzyme Pfu FEN-1 gives no detectable cleavage 
in the absence of the upstream oligonucleotide or even in the presence of an upstream 
oligonucleotide that fails to invade the probe-target complex. This indicates that the Pfu 
FEN-1 endonuclease is a suitable enzyme for use in the methods of the present invention. 

Other structure-specific nucleases may be suitable as a well. As discussed in the 
first example, some 5' nucleases can be used in conditions that significantly reduce this 
primer-independent cleavage. For example, it has been shown that when the 5' nuclease 
of DNAPTaq is used to cleave hairpins the primer-independent cleavage is markedly 
reduced by the inclusion of a monovalent salt in the reaction (Lyamichev, et al, [1993], 
supra). 

III. Effect of ARRESTOR Molecules on Signal and Background in Sequential 
Invasive Cleavage Reactions. 

As described above, the concentration of the probe that is cleaved can be used to 
increase the rate of signal accumulation, with higher concentrations of probe yielding 
higher final signal. However, the presence of large amounts of residual uncleaved probe 
can present problems for subsequent use of the cleaved products for detection or for 
further amplification. If the subsequent step is a simple detection (e.g., by gel 
resolution), the excess uncut material may cause background by streaking or scattering of 
signal, or by overwhelming a detector (e.g., over-exposing a film in the case of 
radioactivity, or exceeding the quantitative detection Umits of a fluorescence imager). 
This can be overcome by partitioning the product firom the uncut probe. In more complex 
detection methods, the cleaved product may be intended to interact with another entity to 
indicate cleavage. As noted above, the cleaved product can be used in any reaction that 
makes use of oligonucleotides, such as hybridization, primer extension, ligation, or the 
direction of invasive cleavage. In each of these cases, the fate of the residual uncut probe 
should be considered in the design of the reaction. In a primer extension reaction, the 
uncut probe can hybridize to a template for extension. If cleavage is required to reveal 
the correct 3* end for extension, the hybridized uncut probe will not be extended. It may, 
however, compete with the cleaved product for the template. If the template is in excess 
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of the combination of cleaved and uncleaved probe, then both of the latter should be able 
to find a copy of template for binding. If, however, the template is limiting, any 
competition may reduce the portion of the cleaved probe that can find successfiilly bind 
to the available template. If a vast excess of probe was used to drive the initial reaction, 
the remainder may also be in vast excess over the cleavage product, and thus may provide 
a very effective competitor, thereby reducing the amount of the final reaction (e.g,, 
extension) product for ultimate detection. 

The participation of the uncut probe material in a secondary reaction can also 
contribute to background in these reactions. While the presentation of a cleaved probe 
for a subsequent reaction may represent an ideal substrate for the enzyme to be used in 
the next step, some enzymes may also be able to act, albeit inefficiently, on the uncut 
probe as well. It was shown during the development of the present invention that 
transcription can be promoted from a nicked promoter even when one side of the nick has 
additional unpaired nucleotides. Similarly, when the subsequent reaction is to be an 
invasive cleavage, the uncleaved probe may bind to the elements intended to form the 
second cleavage structure with the cleaved probe. In experiments conducted during the 
development of the present invention, it was found that some of the 5' nucleases 
described herein can catalyze some measure of cleavage of defective structures. Even at 
a low level, this aberrant cleavage can be misinterpreted as positive target-specific 
cleavage signal. 

With these negative effects of the surfeit of uncut probe considered, there is 
clearly a need for some method of preventing these interactions. As noted above, it is 
possible to partition the cleaved product from the uncut probe after the primary reaction 
by traditional methods. However, these methods are often time consuming, may be 
expensive {e.g,, disposable columns, gels, etc.), and may increase the risk for sample 
mishandling or contamination. It is far preferable to configure the sequential reactions 
such that the original sample need not be removed to a new vessel for subsequent 
reaction. 

The present invention provides a method for reducing interactions between the 
primary probe and other reactants. This method provides a means of specifically 
diverting the uncleaved probes firom participation in the subsequent reactions. The 
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diversion is accomplished by the inclusion in the next reaction step an agent designed to 
specifically interact with the uncleaved primary probe. While the primary probe in an 
invasive cleavage reaction is discussed for reasons of convenience, it is contemplated that 
the ARRESTOR molecules may be used at any reaction step within a chain of invasive 

5 cleavage steps, as needed or desired for the design of an assay. It is not intended that the 
ARRESTOR molecules of the present invention be limited to any particular step. 

The method of diverting the residual uncut probes from a primary reaction makes 
use of agents that can be specifically designed or selected to bind to the uncleaved probe 
molecules with greater affinity than to the cleaved probes, thereby allowing the cleaved 

10 probe species to effectively compete for the elements of the subsequent reaction, even 
when the uncut probe is present in vast excess. These agents have been termed 
"ARRESTOR molecules," due to their function of stopping or arresting the primary 
probe from participation in the later reaction. In various Examples below, an 
ohgonucleotide is provided as an ARRESTOR molecule in an invasive cleavage assay. It 

15 can be appreciated that any molecule or chemical that can discriminate between the full- 
length uncut probe and the cleaved probe, and that can bind or otherwise disable the 
uncleaved probe preferentially may be configured to act as ARRESTOR molecules 
within the meaning of the present invention. For example, antibodies can be derived with 
such specificity, as can the "aptamers" that can be selected through multiple steps of z« 

20 vitro amplification {e.g., "SELEX," U.S. Patent Nos. 5,270,163 and 5,567,588; herein 
incorporated by reference) and specific rounds of capture or other selection means. 

In one embodiment, the ARRESTOR molecule is an oligonucleotide. In another 
embodiment the ARRESTOR oligonucleotides is a composite oligonucleotide, 
comprising two or more short oligonucleotides that are not covalently linked, but that 

25 bind cooperatively and are stabilized by co-axial stacking. In a preferred embodiment, 
the oligonucleotide is modified to reduce interactions with the cleavage agents of the 
present invention. When an ohgonucleotide is used as an ARRESTOR oligonucleotide, it 
is intended that it not participate in the subsequent reactive step. The binding of the 
ARRESTOR ohgonucleotide to the primary probe may, either with the participation of 

30 the secondary target, or without such participation, create a bifiircated structure that is a 
substrate for cleavage by the 5' nucleases used in some embodiments of the methods of 



115 



the present invention. Formation of such structures would lead to some level of 
unintended cleavage that could contribute to background, reduce specific signal or 
compete for the enzyme. It is preferable to provide ARRESTOR oligonucleotides that 
will not create such cleavage structures. One method of doing this is to add to the 
ARRESTOR oligonucleotides such modifications as have been found to reduce the 
activity of INVADER oligonucleotides, as the INVADER oligonucleotides occupy a 
similar position within a cleavage structure (i.e.. the 3' end of the INVADER 
oligonucleotide positions the site of cleavage of an unpaired 5' arm). Modification of the 
3' end of the INVADER oligonucleotides was examined for the effects on cleavage in the 
Example section below; a number of the modifications tested were found to be 
significantly debilitating to the fiinction of the INVADER oligonucleotide. Other 
modifications not described herein may be easily characterized by performing such a test 
using the cleavage enzyme to be used in the reaction for which the ARRESTOR 
oligonucleotide is intended. 

In a preferred embodiment, the backbone of an ARRESTOR oligonucleotide is 
modified. This may be done to increase the resistance to degradation by nucleases or 
temperature, or to provide duplex structure that is a less favorable substrate for the 
enzyme to be used {e.g., A-form duplex vs. B-form duplex). In particularly preferred 
embodiment, the backbone-modified oligonucleotide further comprises a 3' terminal 
modification. In a preferred embodiment, the modifications comprise 2' O-methyl 
substitution of the nucleic acid backbone, while in a particularly preferred embodiment, 
the 2' O-methyl modified oUgonucleotide further comprises a 3' terminal amine group. 

The purpose of the ARRESTOR ohgonucleotide is to allow the minority 
population of cleaved probe to effectively compete with the uncleaved probe for binding 
whatever elements are to be used in the next step. While an ARRESTOR ohgonucleotide 
that can discriminate between the two probe species absolutely (i.e., binding only to 
uncut and never to cut) may be of the greatest benefit in some embodiments, it is 
envisioned that in many appHcations, including the sequential INVADER assays 
described herein, the ARRESTOR ohgonucleotide of the present invention may perform 
the intended fiinction with only partial discrimination. When the ARRESTOR 
oligonucleotide has some interaction with the cleaved probe, it may prevent detection of 
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some portion of these cleavage products, thereby reducing the absolute level of signal 
generated from a given amount of target material. If this same ARRESTOR 
oligonucleotide has the simultaneous effect of reducing the background of the reaction 
{i.e., from non-target specific cleavage) by a factor that is greater than the factor of 

5 reduction in the specific signal, then the significance of the signal (i. e., the ratio of signal 
to background), is increased, even with the lower amount of absolute signal. Any 
potential ARRESTOR molecule design may be tested in a simple fashion by comparing 
the levels of background and specific signals from reactions that lack ARRESTOR 
molecules to the levels of background and specific signal from similar reactions that 

1 0 include ARRESTOR oligonucleotides. What constitutes an acceptable level of tradeoff 
of absolute signal for specificity will vary for different applications (e.g., target levels, 
read-out sensitivity, etc.), and can be determined by any individual user using the 
methods of the present invention. 

1 5 IV. Improved Enzymes For Use In INVADER Oligonucleotide-Directed 
Cleavage Reactions Comprising RNA Targets; 

A cleavage structure is defined herein as a structure that is formed by the 
interaction of a probe oUgonucleotide and a target nucleic acid to form a duplex, the 

20 resulting structure being cleavable by a cleavage agent, including but not limited to an 
enzyme. The cleavage structure is further defined as a substi-ate for specific cleavage by 
the cleavage means in contrast to a nucleic acid molecule that is a substrate for 
nonspecific cleavage by agents such as phosphodiesterases. In considering 
improvements to enzymatic cleavage agents, one may consider the action of said 

25 enzymes on any structures that fall within the definition of a cleavage structure. Specific 
cleavage at any site within such a structure is contemplated. 

Improvements in an enzyme may be an increased or decreased rate of cleavage of 
one or more types of structures. Improvements may also result in more or fewer sites of 
cleavage on one or more of said cleavage structures. In developing a library of new 

30 structure-specific nucleases for use in nucleic acid cleavage assays, improvements may 
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have many different embodiments, each related to the specific substrate structure used in 
a particular assay. 

As an example, one embodiment of the INVADER oligonucleotide-directed 
cleavage assay of the present invention may be considered. In the INVADER 
oligonucleotide-directed cleavage assay, the accumulation of cleaved material is 
influenced by several features of the enzyme behavior. Not surprisingly, the turnover 
rate, or the number of structures that can be cleaved by a single enzyme molecule in a set 
amount of time, is very important in determining the amount of material processed during 
the course of an assay reaction. If an enzyme takes a long time to recognize a substrate 
{e.g., if it is presented with a less-than-optimal structure), or if it takes a long time to 
execute cleavage, the rate of product accumulation is lower than if these steps proceeded 
quickly. If these steps are quick, yet the enzyme "holds on" to the cleaved structure, and 
does not immediately proceed to another uncut structure, the rate will be negatively 
affected. 

Enzyme turnover is not the only way in which enzyme behavior can negatively 
affect the rate of accumulation of product. When the means used to visualize or measure 
product is specific for a precisely defined product, products that deviate from that 
definition may escape detection, and thus the rate of product accumulation may appear to 
be lower than it is. For example, if one had a sensitive detector for trinucleotides that 
could not see di- or tetranucleotides, or any sized oUgonucleotide other that 3 residues, in 
the INVADER-directed cleavage assay of the present invention any errant cleavage 
would reduce the detectable signal proportionally. It can be seen from the cleavage data 
presented here that, while there is usually one site within a probe that is favored for 
cleavage, there are often products that arise from cleavage one or more nucleotides away 
from the primary cleavage site. These are products that are target-dependent, and are 
thus not non-specific background. Nevertheless, if a subsequent visualization system can 
detect only the primary product, these represent a loss of signal. One example of such a 
selective visualization system is the charge reversal readout presented herein, in which 
the balance of positive and negative charges determines the behavior of the products. In 
such a system the presence of an exfra nucleotide or the absence of an expected 
nucleotide can exclude a legitimate cleavage product from ultimate detection by leaving 
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that product with the wrong balance of charge. It can be easily seen that any assay that 
can sensitively distinguish the nucleotide content of an oligonucleotide, such as standard 
stringent hybridization, suffers in sensitivity when some fraction of the legitimate product 
is not eligible for successful detection by that assay. 

5 These discussions suggest two highly desirable traits in any enzyme to be used in 

the method of the present invention. First, the more rapidly the enzyme executes an 
entire cleavage reaction, including recognition, cleavage and release, the more signal it 
may potentially created in the INVADER oligonucleotide-directed cleavage assay. 
Second, the more successful an enzyme is at focusing on a single cleavage site within a 

10 structure, the more of the cleavage product can be successfully detected in a selective 
read-out. 

The rationale cited above for making improvements in enzymes to be used in the 
INVADER oligonucleotide-directed cleavage assay are meant to serve as an example of 
one direction in which improvements might be sought, but not as a limit on either the 

1 5 nature or the applications of improved enzyme activities. As another direction of activity 
change that would be appropriately considered improvement, the DNAP-associated 5' 
nucleases may be used as an example. In creating some of the polymerase-deficient 5' 
nucleases described herein it was found that the those that were created by deletion of 
substantial portions of the polymerase domain, assumed activities that were weak or 

20 absent in the parent proteins. These activities included the ability to cleave non-forked 
structures, a greatly enhanced ability to exonucleolytically remove nucleotides from the 
5' ends of duplexed strands, and a nascent ability to cleave circular molecules without 
benefit of a free 5' end. 

In addition to the 5' nucleases derived from DNA polymerases, the present 

25 invention also contemplates the use of structure-specific nucleases that are not derived 
from DNA polymerases. For example, a class of eukaryotic and archaebacterial 
endonucleases have been identified which have a similar substrate specificity to 5' 
nucleases of Pol I-type DNA polymerases. These are the FENl (Flap EndoNuclease), 
RAD2, and XPG (Xeroderma Pigmentosa-complementation group G) proteins. These 

30 proteins are involved in DNA repair, and have been shown to favor the cleavage of 

structures that resemble a 5* arm that has been displaced by an extending primer during 
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polymerization. Similar DNA repair enzymes have been isolated from single cell and 
higher eukaryotes and from archaea, and there are related DNA repair proteins in 
eubacteria. Similar 5' nucleases have also been associated with bacteriophage such as T5 
andT7. 

5 Recently, the 3-dimensional structures of DNAPTaq and T5 phage 5'-exonuclease 

were determined by X-ray diffraction (Kim et al.. Nature 376:612 [1995]; and Ceska et 
al. Nature 382:90 [1995]). The two enzymes have very similar 3-dimensional structures 
despite limited amino acid sequence similarity. The most striking feature of the T5 
5'-exonuclease structure is the existence of a triangular hole formed by the active site of 

10 the protein and two alpha helices. This same region of DNAPTaq is disordered in the 
crystal structure, indicating that this region is flexible, and thus is not shown in the 
published 3-dimensional structure. However, the 5' nuclease domain of DNAPTaq is 
likely to have the same structure, based its overall 3-dimensional similarity to T5 
5'-exonuclease, and that the amino acids in the disordered region of the DNAPTaq 

1 5 protein are those associated with alpha hehx formation. The existence of such a hole or 
groove in the 5' nuclease domain of DNAPTaq was predicted based on its substrate 
specificity (Lyamichev et al, supra). 

It has been suggested that the 5' arm of a cleavage structure must thread through 
the helical arch described above to position said structure correctly for cleavage (Ceska et 

20 al, supra). One of the modifications of 5' nucleases described herein opened up the 
helical arch portion of the protein to allow improved cleavage of structures that cut 
pooriy or not at all {e.g., structures on circular DNA targets that would preclude such 
threading of a 5' arm). The gene construct that was chosen as a model to test this 
approach was the one called CLEAVASE BN, which was derived from DNAPTaq but 

25 does not contain the polymerase domain. It comprises the entire 5' nuclease domain of 
DNAP Taq, and thus should be very close in structure to the T5 5' exonuclease. This 5' 
nuclease was chosen to demonstrate the principle of such a physical modification on 
proteins of this type. The arch-opening modification of the present invention is not 
intended to be limited to the 5' nuclease domains of DNA polymerases, and is 

30 contemplated for use on any structure-specific nuclease that includes such an aperture as 
a lunitation on cleavage activity. The present invention contemplates the insertion of a 
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thrombin cleavage site into the heUcal arch of DNAPs derived jfrom the genus Thermus as 
well as 5' nucleases derived from DNAPs derived from the genus Thermus, The specific 
example shown herein using the CLEAVASE BN/thrombin nuclease merely illustrates 
the concept of opening the helical arch located within a nuclease domain. As the amino 
acid sequence of DNAPs derived from the genus Thermus are highly conserved, the 
teachings of the present invention enable the insertion of a thrombin site into the helical 
arch present in these DNAPs and 5' nucleases derived from these DNAPs, 

The opening of the helical arch was accomplished by insertion of a protease site 
in the arch. This allowed post-translational digestion of the expressed protein with the 
appropriate protease to open the arch at its apex. Proteases of this type recognize short 
stretches of specific amino acid sequence. Such proteases include thrombin and factor 
Xa. Cleavage of a protein with such a protease depends on both the presence of that site 
in the amino acid sequence of the protein and the accessibility of that site on the folded 
intact protein. Even with a crystal structure it can be difficult to predict the susceptibility 
of any particular region of a protein to protease cleavage. Absent a crystal structure it 
must be determined empirically. 

In selecting a protease for a site-specific cleavage of a protein that has been 
modified to contain a protease cleavage site, a first step is to test the unmodified protein 
for cleavage at alternative sites. For example, DNAPTaq and CLEAVASE BN nuclease 
were both incubated under protease cleavage conditions with factor Xa and thrombin 
proteases. Both nuclease proteins were cut with factor Xa within the 5' nuclease domain, 
but neither nuclease was digested with large amounts of thrombin. Thus, thrombin was 
chosen for initial tests on opening the arch of the CLEAVASE BN enzyme. 

In the protease/ CLEAVASE modifications described herein the factor Xa 
protease cleaved strongly in an unacceptable position in the unmodified nuclease protein, 
in a region likely to compromise the activity of the end product. Other unmodified 
nucleases contemplated herein may not be sensitive to the factor Xa, but may be sensitive 
to thrombin or other such proteases. Alternatively, they may be sensitive to these or 
other such proteases at sites that are immaterial to the function of the nuclease sought to 
be modified. In approaching any protein for modification by addition of a protease 
cleavage site, the unmodified protein should be tested with the proteases under 
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consideration to determine which proteases give acceptable levels of cleavage in other 
regions. 

Working with the cloned segment of DNAPTaq from which the CLEAVASE BN 
protein is expressed, nucleotides encoding a thrombin cleavage site were introduced 

5 in-frame near the sequence encoding amino acid 90 of the nuclease gene. This position 
was determined to be at or near the apex of the heUcal arch by reference to both the 
3-dimensional structure of DNAPTaq, and the structure of T5 5' exonuclease. The 
encoded amino acid sequence, LVPRGS, was inserted into the apex of the helical arch by 
site-directed mutagenesis of the nuclease gene. The proline (P) in the thrombin cleavage 

10 site was positioned to replace a proline normally in this position in CLEAVASE BN 
because proline is an alpha helix-breaking amino acid, and may be unportant for the 
3-dimensional structure of this arch. This construct was expressed, purified and then 
digested with thrombin. The digested enzyme was tested for its ability to cleave a target 
nucleic acid, bacteriophage Ml 3 genomic DNA, that does not provide free 5' ends to 

15 facilitate cleavage by the threading model. 

While the helical arch in this nuclease was opened by protease cleavage, it is 
contemplated that a number of other techniques could be used to achieve the same end. 
For example, the nucleotide sequence could be rearranged such that, upon expression, the 
resulting protein would be configured so that the top of the helical arch (amino acid 90) 

20 would be at the amino terminus of the protein, the natural carboxyl and amino termini of 
the protein sequence would be joined, and the new carboxyl terminus would lie at natural 
amino acid 89. This approach has the benefit that no foreign sequences are introduced 
and the enzyme is a single amino acid chain, and thus may be more stable that the 
cleaved 5' nuclease. In the crystal structure of DNAPTaq, the amino and carboxyl 

25 termini of the 5 '-exonuclease domain lie in close proximity to each other, which suggests 
that the ends may be directly j oined without the use of a flexible linker peptide sequence 
as is sometimes necessary. Such a rearrangement of the gene, with subsequent cloning 
and expression could be accomplished by standard PGR recombination and cloning 
techniques known to those skilled in the art. 

30 The INVADER invasive cleavage reaction has been shown to be usefiil in the 

detection of RNA target strands {See e.g., U.S. Patent 6,001,567, incorporated herein by 
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reference in its entirety). As with the INVADER assay for the detection of DNA 
(Lyamichev et ai, Nat BiotechnoL, 17:292 [1999]), the reactions may be run under 
conditions that permit the cleavage of many copies of a probe for each copy of the target 
RNA present in the reaction. In one embodiment, the reaction may be performed at a 
5 temperature close to the melting temperature (Tm) of the probe that is cleaved, such that 
the cleaved and uncleaved probes readily cycle on and off the target strand without 
temperature cycling. Each time a full-length probe binds to the target in the presence of 
the INVADER ohgonucleotide, it may be cleaved by a 5' nuclease enzyme, resulting in 
an accumulation of cleavage product. The accumulation is highly specific for the 
10 sequence being detected, and may be configured to be proportional to both time and 
J target concentration of the reaction. In another embodiment, the temperature of the 

p reaction may be shifted (Le. , it may be raised to a temperature that will cause the probe to 

i dissociate) then lowered to a temperature at which a new copy of the probe hybridizes to 

:J the target and is cleaved by the enzyme. In a further embodiment, the process of raising 

^ 15 and lowering the temperature is repeated many times, or cycled, as it is in PGR (MuUis 
and Faloona, Methods in Enzymology, 155:335 [1987], Saiki et al. Science 230:1350 
111 [1985]). 

f: As noted above, 5' nucleases of Pol A type DNA polymerases are preferred for 

cleavage of an invasive cleavage structure that comprises an RNA target strand. The 
20 present invention provides enzymes having improved performance in detection assays 
based on the cleavage of a structure comprising RNA. In particular, the altered 
polymerases of the present invention exhibit improved performance in detection assays 
based on the cleavage of a DNA member of an invasive cleavage structure that comprises 
an RNA target strand. 

25 The 5' nucleases of the present invention may be derived from Pol A type DNA 

polymerases. The terminology used in describing the alterations made in this class of 5' 
nucleases relates to the descriptions of DNA polymerase structures known in the art. The 
Klenow fragment of the Pol A polymerase from E. coli (the C-terminal two thirds, which 
has the DNA synthesizing activity but lacks the 5' nuclease activity) has been described 

30 as having a physical form resembling a right hand, having an open region called the 
"palm", and a cleft that holds the primer/template duplex defined on one side by a 
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"fingers" domain and on the other by a "thumb" domain (Joyce and Steitz, Trends in 
Biochemical Science 12:288 [1987]). This is shown schematically in Figure 5. Because 
this physical form has proved to be common to all Pol A DNA polymerases and to a 
number of additional template-dependent polymerizing enzymes such as reverse 

5 transcriptases, the hand terminology has become known in the art, and the sites of activity 
in these enzymes are often described by reference to their position on the hand. For 
reference, and not intended as a limitation on the present invention, the palm is created 
from roughly the first 200 amino acids of the polymerase domain, the thumb from the 
middle 140, and the fingers by the next 160, with the base of the cleft formed from the 

10 remaining regions (Figures 6). Although some enzymes may deviate from these 

structural descriptions, the equivalent domains and sequences corresponding to such 
domains may be identified by sequence homology to known enzyme sequences, by 
comparison of enzyme crystal structures, and other like methods. 

In creating the improved enzymes of the present invention, several approaches 

1 5 have been taken, although the present invention in not limited to any particular approach. 
First two DNA polymerases, Taq and Tth, that have different rates of DNA strand 
cleavage activity on RNA targets were compared. To identify domains related to the 
differences in activity, a series of chimerical constructs was created and the activities 
were measured. This process identified two regions of the Tth polymerase that could, if 

20 transferred into the Taq polymerase, confer on the TaqPol an RNA-dependent cleavage 
activity equivalent to that of the native Tth protein. Once these regions were identified, 
the particular amino acids involved in the activity were examined. Since the two proteins 
are about 87 percent identical in amino acid sequence overall, the identified regions had 
only a small number of amino acid differences. By altering these amino acids singly and 

25 in combinations, a pair of amino acids were identified in TthPol that, if introduced into 
the TaqPol protein, increased the rate of cleavage up to that of the native TthPol. 

These data demonsfrate two important aspects of the present invention. First, 
specific amino acids can be changed to confer TthPol-like RNA-dependent cleavage 
activity on a polymerase having a lesser activity. More broadly, however, these results 

30 provide regions of these polymerases that are involved in the recognition of the 

RNA-containing cleavage structure. Identification of these important regions, combined 
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with published information on the relationships of other amino acids to the various 
functions of these DNA polymerases and computer-assisted molecular modeling during 
the development of the present invention have allowed a rational design approach to 
create additional improved 5' nucleases. The information also allowed a focused random 
mutagenesis approach coupled with a rapid screening procedure to quickly create and 
identify enzymes having improved properties. Using these methods of the present 
invention, a wide array of improved polymerases are provided. 

The methods used in creating and selecting the improved 5' nucleases of the 
present invention are described in detail below and in the experimental examples. A 
general procedure for screening and characterizing the cleavage activity of any 5' 
nuclease is included in the experimental examples. The methods discussions are divided 
into the following sections: I) Creation and selection of chimerical constructs; II) 
Site-specific mutagenesis based on information from chimerical constructs; III) 
Site-specific mutagenesis based on molecular modeling and pubHshed physical studies; 
and IV) focused random mutagenesis. 

1) Creation and selection of chimerical constructs 

The PolA-type DNA polymerases, including but not limited to DNA polymerase 
enzymes fi:om Thermus species, comprise two distinctive domams, the 5' nuclease and 
the polymerase domains, shown schematically in Figure 6. The polymerase domains 
reside in the C-terminal two-thirds of the proteins and are responsible for both 
DNA-dependent and RNA-dependent DNA polymerase activities. The N-terminal 
one-third portions contain the 5' nuclease domains. In the genus Thermus Pol A 
polymerase, the palm region consists of, roughly, amino acids 300-500, the thumb region 
includes amino acids 500-650, while the fingers region is formed by the remaining amino 
acids fi-om 650 to 830 (Figure 6). 

The derivatives, Taq DN RX HT and Tth DN RX HT, of Taq and TthPol used in 
many of the experiments of the present invention, and described herein, are modified to 
reduce synthetic activity and to facilitate chimera construction, but have 5' nuclease 
activity essentially identical to unmodified TaqPol and TthPol. Unless otherwise 
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specified, the TaqPol and TthPol enzymes of the following discussion refer to the DN RX 
HT derivative. 

TthPol has a 4-fold higher cleavage rate with the IL-6 RNA template (shown in 
Figure 10) than TaqPol (shown in Figures 1 1 and 12), although the Taq and TthPols show 
5 similarities of cleavage in DNA target structures (Figure 10). Since the amino acid 
sequences of TaqPol and TthPol (Figures 8 and 9) share about 87% identity and greater 
than 92% similarity, the high degree of homology between the enzymes allowed creation 
of a series of chimeric enzymes between TthPol and TaqPol. The activity of the chimeric 
enzymes was used as a parameter to identify the region(s) of these proteins affecting 
1 0 RNA dependent 5' nuclease activity. 

The chimeric constructs between TthPol and TaqPol genes shown schematically 
in Figures 7 and 19 were created by swapping DNA fragments defined by the restriction 
endonuclease sites, EcoRI and BamHI, common for both genes, the cloning vector site 
Sail and the new sites, NotI, BstBI and Ndel, created at the homologous positions of both 
1 5 genes by site directed mutagenesis. The restriction enzymes have been abbreviated as 
follows: EcoRI is E; NotI is N; BstBI is Bs; Ndel is D, BamHI is B, and Sail is S. 

The activity of each chimeric enzyme was evaluated using the invasive signal 
ampHfication assay with the IL-6 RNA target (Figure 10), and the cycling cleavage rates 
shown in Figure 12 were determined as described in the Experimental Examples. 
20 Comparison of the cleavage rates of the first two chimeras, TaqTth(N) and TthTaq(N), 
created by swapping the polymerase and 5' nuclease domains at the NotI site (Figure 7), 
shows that TaqTth(N) has the same activity as TthPol, whereas its counterpart TthTaq(N) 
retains the activity of TaqPol (Figure 12). This result indicates that the higher cleavage 
rate of TthPol is associated with its polymerase domain and suggests an important role of 
25 the polymerase domain in the 5' nuclease activity. 

The next step was to identify a minimal region of TthPol polymerase that would 
give rise to the TthPol-like RNA dependent 5' nuclease activity when substituted for the 
corresponding region of the TaqPol sequence. To this end, the TaqTth(N) chimera was 
selected to generate a series of new constructs by replacing its TthPol sequence with 
30 homologous regions of TaqPol. First, the N-terminal and C-terminal parts of the TaqPol 
polymerase domain were substituted for the corresponding regions of TaqTth(N) using 
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the common BamHI site as a breaking point to create TaqTth(N-B) and TaqTth(B-S) 
chimeras, respectively (Figure 7). TaqTth(N-B) which has the TthPol sequence between 
amino acids 328 and 593, is approximately 3 times more active than the TaqTth(B-S) and 
40% more active than TthPol (Figure 12). This result estabUshes that the Notl-BamHI 

5 portion of the TthPol polymerase domain determines superior RNA-dependent 5' 
nuclease activity of TthPol. 

From these data it was determined that a central portion of the TthPol, when used 
to replace the homologous portion of TaqPol (TaqTth(N-B) construct) could confer 
superior RNA recognition on the chimerical protein composed primarily of Taq protein. 

1 0 In fact, the cycling rate of this chimerical protein exceeded that of the parent TthPol. 
Comparison of chimeras that included sub-portions of the activity-improving region of 
TthPol, approximately 50% of the region in each case (See, TaqTth(N-D) and 
TaqTth(D-B), Figures 7 and 12) showed no significant improvement in RNA dependent 
activity as compared to the parent TaqPol. This result indicates that aspects of each half 

1 5 of the region are required for this activity. A construct having an only slightly smaller 
portion of the Tth insert portion (TaqTth(Bs-B)) showed activity that was close to that of 
the parent TthPol protein, but which was less than that of the TaqTth(N-B) construct. 

2) Site-specific mutagenesis based on information from chimerical 
20 constructs 

Comparison of the TthPol and TaqPol amino acid sequences between the BstBI 
and BamHI sites reveals only 25 differences (Figure 13). Among those, there are 12 
conservative changes and 13 substitutions resulting in a change in charge. Since the 
analysis of the chimeric enzymes has suggested that some critical amino acid changes are 

25 located in both BstBI-Ndel and Ndel-BamHI regions of TthPol, site directed 

mutagenesis was used to introduce the TthPol specific amino acids into the BstBI-Ndel 
and Ndel-BamHI regions of the TaqTth(D-B) and TaqTth (N-D) chimeras, respectively. 
Six TthPol-specific substitutions were generated in the BstBI-Ndel region of the 
TaqTth(D-B) by single or double amino acid mutagenesis and only one double mutation, 

30 W4 1 7L/G4 1 8K, was able to restore the TthPol activity with the IL-6 RNA target (See 
e.g.. Figure 14). Similarly, 12 TthPol specific amino acids were introduced at the 
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homologous positions of the Ndel-BamHI region of the TaqTth(N-D) and only one of 
them, E507Q, increased the cleavage rate to the TthPol level {See e.g.. Figure 14). 

To confirm that the W417L, G418K and E507Q substitutions are sufficient to 
increase the TaqPol activity to the TthPol level, TaqPol variants carrying these mutations 
5 were created and their cleavage rates with the IL-6 RNA substrate were compared with 
that of TthPol. Figure 15 shows that the TaqPol W417L/G418K/E507Q and TaqPol 
G418K/E507Q mutants have 1.4 times higher activity than TthPol and more than 4 fold 
higher activity than TaqPol, whereas the TaqPol W417L/E507Q mutant has the same 
activity as TthPol, which is about 3 fold higher than TaqPol. These results demonstrate 
1 0 that K4 1 8 and Q507 of TthPol are important amino acids in defining its superior RNA 
dependent 5' nuclease activity compared to TaqPol. 

The abihty of these amino acids to improve the RNA dependent 5' nuclease 
activity of a DNA polymerase was tested by introducing the corresponding mutations into 
the polymerase A genes of two additional organisms: Thermus filiformus and Thermus 
1 5 scotoductus. TaqPol showed improved RNA dependent activity when it was modified to 
contain the W417L and E507Q mutations, which made it more similar at these residues 
to the corresponding residues of TthPol (K418 and Q507). The TfiPol was modified to 
have P420K and E507Q, creating TfiDN 2M, while the TscPol was modified to have 
E416K and E505Q, to create TscDN 2M. The activity of these enzymes for cleaving 
20 various DNA and RNA containing structures was determined as described in Example 1, 
using the IdT2, IrT3, hairpin and X-structures diagrammed in Figures 21 and 22, with the 
results shown in both Figure 25 and Table 8. Both enzymes have much less 
RNA-dependent cleavage activity than either the TthPol or the Taq 2M enzymes. 
However, introduction of the mutations cited above into these polymerases increased the 
25 RNA dependent cleavage activity over 2 fold compared to the unmodified enzymes 

(Figure 25). These results demonstrate that transferability of improved RNA dependent 
cleavage activity into a wide range of polymerases using the methods of the present 
invention. 
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3) Site-specific mutagenesis based on molecular modeling and published 
physical studies 

The positions of the G418H and E507Q mutations in the crystal structure of a 
complex of the large fragment of TaqPol (Klentaql) with a primer/template DNA 
5 determined by Li et al (Li et al. Protein Sci,, 7:11 16 [1998]) are shown in Figure 17. 
The E507Q mutation is located at the tip of the thumb subdomain at a nearest distance of 
3.8 A and 18 A from the backbone phosphates of the primer and template strands, 
respectively. The interaction between the thumb and the minor groove of the DNA 
primer/template was previously suggested by the co-crystal structures of Klenow 

10 fragment DNA polymerase I (Breese et al. Science 260:352 [1993]) and TaqPol (Eom et 
al. Nature 382:278 [1996]). Deletion of a 24 amino acid portion of the tip of the thumb 
in Klenow fragment, corresponding to amino acids 494-518 of TaqPol, reduces the DNA 
binding affinity by more than 100-fold (Minnick et al, J. Biol. Chem., 271:24954 
[1996]). These observations are consistent with the hypothesis that the thumb region, 

15 which includes the E507 residue, is involved in interactions with the upstream substrate 
duplex. 

The W417L and the G418K mutations in the palm region of TaqPol (Figure 17) 
are located approximately 25 A from the nearest phosphates of the template and upstream 
strands, according to the co-crystal structures of TaqPol with duplex DNA bound in the 

20 polymerizing mode (Li et al. Protein Sci., 7:1 1 16 [1998], Eom et al. Nature 382:278 

[1996]). The same distance was observed between the analogous W513 and P514 amino 
acids of Klenow fragment and the template strand of DNA bound in the editing mode 
(Breese et al. Science 260:352 [1993]), Thus, no interactions between TaqPol and the 
overlapping substrate can be suggested from the available co-crystal studies for this 

25 region. 

Although an understanding of the mechanism of action of the enzymes is not 
necessary for the practice of the present invention and the present invention is not limited 
to any mechanism of action, it is proposed that the amino acids at positions 417 and 418 
in the palm region of TaqPol interact with the upstream substrate duplex only when the 
30 enz)Tiie ftmctions as a 5* nuclease, but no interaction with these amino acids occurs when 
TaqPol switches into polymerizing mode. This hypothesis suggests a novel mode of 
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substrate binding by DNA polymerases called here the "5* nuclease mode." Several lines 
of evidence support this hypothesis. The study of the chimeric enzymes described here 
clearly separates regions of the polymerase domain involved in the 5' nuclease and 
polymerase activities. Accordingly, the W417L and G418K mutations, together with the 

5 E507Q mutation, affect the 5' nuclease activity of TaqPol on substrates having an RNA 
target strand (Figure 15), but have no effect on either RNA-dependent or DNA-dependent 
DNA polymerase activities (Figure 16), On the other hand, mutations in the active site of 
TaqPol, such as R573A, R587A, E615A, R746A, N750A and D785N, which correspond 
to substitutions in Klenow fragment oiExoli DNA Pol I that affect both polymerase 

1 0 activity and substrate binding affinity in the polymerizing mode (Polesky et alj. Biol. 
Chem., 265:14579 [1990], Polesky et al, J. Biol. Chem., 267:8417 [1992], Pandey et aL, 
Eur. J. Biochem., 214:59 [1993]) were shown to have little or no effect on the 5* nuclease 
activity. Superposition of the polymerase domains of TaqPol (Eom et al. Nature 
382:278 [1996]), Exoli Pol I and Bacillus stearothermophilus Pol I (Kiefer et al. Nature 

15 391:304 [1998]) using the programs DALI (Holm and Sander, J. Mol. Biol, 233:123 

[1993], Holm and Sander, Science 273:595 [1996]) and Insight II (Molecular Simulation 
Inc., Naperville, IL) shows that the palm region of TaqPol between amino acids 402-451, 
including W417 and G418, is structurally highly conserved between the three 
polymerases, although there is no structural similarity between the rest of the palm 

20 subdomains. This observation suggests an important role for this region in eubacterial 
DNA polymerases. 

The 5' nuclease and polymerase activities should be precisely synchronized to 
create a nicked structure rather than a gap or an overhang that could cause a deletion or 
an insertion during Okazaki fragment processing or DNA repair, if ligase joins the ends 

25 inappropriately. According to the previously proposed model (Kaiser et al, J. Biol. 
Chem., 274:21387 [1999]), the 3* terminal nucleotide of the upstream strand is 
sequestered by the 5' nuclease domain to prevent its extension, thus halting synthesis. 
The interaction with the 3' nucleotide apparently activates the 5' nuclease that 
endonucleolitically removes the displaced 5' arm of the downstream strand. This 

30 cleavage occurs by the precise incision at the site defined by the 3' nucleotide, thus 
creating the nick. This model requires a substantial rearrangement of the 
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substrate-enzyme complex, which may include a translocation of the complex to the 5' 
nuclease mode to separate the primer/template from the polymerase active site. 

It is possible that a relocation of the substrate away from the polymerase active 
site could be induced by the interaction between the duplex formed between the template 
and incoming strands and the crevice formed by the finger and thumb subdomains. Such 
an interaction could force conformational transitions in the thumb that would bring the 
template/primer duplex into close contact with the W417 and G418 amino acids. 
Significant flexibility of the thumb has been previously reported that might explain such 
changes (Beese et ai. Science 260:352 [1993], Eom et al. Nature 382:278 [1996], OUis 
et al. Nature 313:762 [1985], Kim et a/., Nature 376:612 [1995], Korolev et al, Proc. 
Natl. Acad. Sci., 92:9264 [1995], Li et al, EMBO J., 17:7514 [1998]). Additional 
conformational changes of the fingers domain that might help to open the crevice, such as 
the transition from the 'closed' to the 'open' structure described by Li et al (Li et al, 
EMBO J., 17:7514 [1998]), are consistent with this model. It may be that the 5' nuclease 
binding mode was not observed in any of the published co-crystal structures of a DNA 
Pol I because the majority of the structures were solved for the polymerase domain only, 
with a template/primer subsfrate rather than with an overlapping 5' nuclease substrate. 

Km values of 200-300 nM have been determined for TaqPol, TthPol and TaqPol 
G418K/E507Q for the RNA containing substrate. These values are much higher than the 
Km value of <1 nM estimated for TthPol with an all-DNA overlapping substrate 
suggesting that the RNA template adversely affects substrate binding. The low affinity 
could be explained by the unfavorable interaction between the enzyme and either the 
A-form duplex adopted by the substrate with an RNA target, or the ribose 2' hydroxyls of 
the RNA sfrand. Between these two factors, the latter seems more likely, since the 5' 
nucleases of eubacterial DNA polymerases can efficiently cleave substrates with an RNA 
downstream probe (Lyamichev et al. Science 260:778 [1993]), which would presumably 
have an A-form. Further, the co-crystal studies suggest that the template/primer duplex 
partially adopts a conformation close to A-form in its complex with DNA polymerase 
(Eom et al. Nature 382:278 [1996], Kiefer et al, Nature 391 :304 [1998], Li et al, 
EMBO J., 17:7514 [1998]). The G418K/E507Q mutations increase the kcat of TaqPol 
more than two fold, but have little effect on Km- Such an effect would be expected if the 
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mutations position the substrate in an orientation more appropriate for cleavage rather 
than simply increasing the binding constant. 

In addition to the mutational analysis described above, another approach to 
studying specific regions of enzymes, enzyme structure-function relationships, and 
5 enzyme-substrate interaction is to investigate the actual, physical structure of the 
molecule. 

With the advances in crystallographic, NMR, and computer and software 
technology, study of molecular structure has become a viable tool for those interested in 
the configuration, organization, and dynamics of biomolecules. Molecular modeling has 

10 increased the understanding of the nature of the interactions that underlie the structure of 
proteins and how proteins interact functionally with substrate. Numerous pubhcations 
describing the structures of various polymerases or polymerase protein portions, HIV 
reverse transcriptase, and other nucleic acid binding proteins have provided mechanistic 
insights into protein conformation, changes in conformation, and molecular interactions 

1 5 necessary for function. 

As an example, the report by DoubUe et al (Doublie et aL, Nature 391:251 
[1998]) discloses the crystal structure of T7 DNA polymerase and provides information 
about which amino acid regions are likely to have an affect on substrate binding, which 
are required to contact the substrate for polymerization, and which amino acids bind 

20 cofactors, such as metal ions. It is noted in this paper and others that many of the 

polymerases share not only sequence similarity, but structural homology as well. When 
certain structural domains of different polymerases are superimposed (for example, T7 
polymerase, Klenow fragment editing complex, the unliganded Taq DNA polymerase 
and the Taq Polymerase-DNA complex) conserved motifs are clearly discemable. 

25 Specifically, combining the information from all of these different structural 

sources and references, a model of the protein interacting with DNA, RNA, or 
heteroduplex can be made. The model can then be examined to identify amino acids that 
may be involved in substrate recognition or substrate contact. Changes in amino acids 
can be made based on these observations, and the effects on the various activities of the 5* 

30 nuclease proteins are assessed using screening methods such as those of the present 
invention, described in the experimental examples. 
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The domain swapping analysis discussed previously demonstrated that sequences 
of TthDN that are important in RNA-dependent 5' nuclease activity lie in the polymerase 
domain of the protein. Therefore, study of structural data of the polymerase domain with 
respect to nucleic acid recognition provides one method of locating amino acids that, 
5 when altered, aUer RNA recognition in a 5' nuclease reaction. For example, analysis 
conducted during the development of the present invention examined published analyses 
relating to primer/template binding by the polymerase domain of E. coli Pol 1, the 
Klenow fragment. Table 2 shows a sampling of kinetic constants determined for the 
Klenow fragment, and shows the effects a number of mutations on these measurements. 

10 The corresponding or similarly positioned amino acids in the TaqPol are indicated in the 
right hand column. It was postulated that mutations having a noticible impact on the 
interactions of the Klenow fragment with the DNA template or the primer/template 
duplex, as indicated by changes in Kd and Relative DNA affinity values, might also have 
effects when made at the corresponding sites in TaqPol and related chimerical or mutant 

1 5 derivatives. A selection of the mutations that produced a higher value or a lower 

Relative DNA affinity value when introduced into the Klenow fragment were created and 
examined in TaqPol. These Taq derivatives include, but are not limited to, those 
indicated by asterisks in the right hand column of Table 2. 

For some Klenow variants, such as the R682 mutants, selection for testing was 

20 not made based on the DNA affinity measurements, but because molecular modeling 

suggested interaction between some aspect of the template/primer duplex and that amino 
acid. Similarly, additional regions of Taq poljonerase (or Taq derivatives) were targeted 
for mutagenesis based on structural data and information from molecular modeling. 
Based on modeling, the thumb region was postulated to contact an RNA template. Thus, 

25 amino acids in the thumb region were looked for that, if altered, might alter that contact. 
For example, Figures 6 and 17 show that amino acids 502, 504, and 507 are located at the 
tip of the thumb. It was postulated that altering these amino acids might have an affect 
on the enzyme-substrate interaction. Using the activity screening methods described In 
Example 1, mutations that produced beneficial effects were identified. This approach 

30 was used to create a number of improved enzymes. For example, TaqPol position H784, 
corresponding to Klenow amino acid H881, is an amino acid in the fingers region and, as 
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such, may be involved in primer/template substrate binding. When the H881 amino acid 
in the Klenow enzyme is replaced by alanine, the change decreases the affinity of the 
enzyme for DNA to only 30 to 40% of the wild type level An analogous substitution 
was tested in a TaqPol-derived enzyme. Starting with the Taq derivative 
W417L/G418K/E507Q, amino acid 784 was changed from Histidine (H) to Alanine (A) 
to yield the W417L/G418K/E507Q/H784A mutant, termed Taq 4M. This variant showed 
improved 5' nuclease activity on the RNA test IrTl (Figure 24) test substrate (data in 
Table 3). Amino acid R587 is in the thumb region, and was selected for mutation based 
on its close proximity to the primer/template duplex in computer models. When an 
R587A mutation was added to the Taq 4M variant, the activity on the test IrTl test 
substrate was still further improved. In addition, the reduction, relative to the 4M variant, 
in cleavage of the X structure shown in Figure 22 constitutes an additional improvement 
in this enzyme's function. 

Not all amino acid changes that reduce DNA binding in the polymerization affect 
the 5' nuclease activity. For example, mutations E615A, R677A, affecting amino acid 
that are also in the thumb and fingers domains, respectively, have either adverse effect, or 
no effect on the 5' nuclease activities, respectively, as measured using the test substrates 
in Figures 21 and 22, and compared to the parent variants that lacked these changes. The 
R677A mutation was added to, and compared with the TaqSS variant, while the E615A 
mutation was added to and compared with the Taq 4M variant. The test methods 
described herein provide a convenient means of analyzing any variant for the alterations 
in the cleavage activity of both invasive an noninvasive substrates, for both DNA and 
RNA containing structures. Thus, the present invention provides methods for identifying 
all suitable improved enzymes. 

Alterations that might increase the affinity of the enzymes for the nucleic acid 
targets were also examined. Many of the mutations described above were selected 
because they caused the Klenow fragment enzyme to have decreased affinity for DNA, 
with the goal of creating enzymes more accepting of structures containing non-DNA 
strands. In general, the native DNA polymerases show a lower affinity for RNA/DNA 
duplexes, compared to their affinity for DNA/DNA duplexes. During the development of 
the present invention, it was sought to increase the general affinity of the proteins of the 
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present invention for a nucleic acid substrate without restoring or increasing any 
preference for structures having DNA rather than RNA target strands. The substitution of 
amino acids having different charges was examined as a means of altering the interaction 
between the proteins and the nucleic acid substrates. For example, it was postulated that 
addition of positively charged amino acid residues, such as lysine (K), might increase the 
affinity of a protein for a negatively charged nucleic acid. 

As noted above, alterations in the thimib region could affect the interactions of the 
protein with the nucleic acid substrate. In one example, the mutation G504K (tip of the 
thumb) was introduced in Taq4M and caused and enhancement of nuclease activity by 
15% on an RNA target. Additional positively charged mutations (A502K and E507K) 
further improve the RNA target dependent activity by 50% compared to the parent 
Taq4M enzyme. 

The use of data from published studies and molecular modeling, in combination 
with results accrued during the development of the present invention allowed the 
identification of regions of the proteins in which changes of amino acids would be likely 
to cause observable differences in at least one aspect of cleavage function. While regions 
could be targeted in this way, it was observed that changes in different amino acids, even 
if near or immediate neighbors in the protein, could have different effects. For example, 
while the A502K substitution created a marked increase in the RNA-dependent cleavage 
activity of Taq 4M, changing amino acid 499 from G to a K, only 3 amino acids away 
from 502, gave minimal improvement. As can be seen in the Experimental Examples, 
the approach of the present invention was to change several amino acids in a candidate 
region, either alone or in combination, then use the screening method provided in 
Example 1 to rapidly assess the effects of the changes. In this way, the rational design 
approach is easily appUed to the task of protein engineering. 

In addition to the thumb, palm, and hand regions found in the polymerase domain 
of these proteins, regions that are specific to 5' nucleases and nuclease domains were 
examined. Comparative studies on a variety of 5' nucleases have shown that, though the 
amino acid sequences vary dramatically from enzyme to enzyme, there are structural 
features common to most. Two of these features are the helix-hairpin-helix motif 
(H-h-H) and the arch or loop stiricture. The H-h-H motif is beheved to mediate 
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non-sequence specific DNA binding. It has been found in at least 14 families of proteins, 
including nucleases, N-glycosylases, ligases, helicases, topoisomerases, and polymerases 
(Doherty et al, Nucl. Acid. Res., 24:2488 [1996]). The crystallographic structure of rat 
DNA polymerase pol p bound to a DNA template-primer shows non-specific hydrogen 
bonds between the backbone nitrogens of the pol p HhH motif and the phosphate 
oxygens of the primer of the DNA duplex (Pelletier et al. Science 264:1891 [1994]), 
Because the HhH domain of 5' nuclease domains of Taq and Tth polymerases may 
function in a similar manner, it is contemplated that mutations in the HhH region of the 
enzyme alter activity. Mutations may be introduced to alter the shape and structure of the 
motif, or to change the charge of the motif to cause increased or decreased affinity for 
substrate. 

Another structure common to many 5' nucleases fi-om diverse sources such as 
eukaryotes, eubacteria, archaea and phage, is the arch or loop domain. The crystal 
structure of the 5' exonuclease of bacteriophage T5 showed a distinct arch formed by two 
helices, one positively charged and one containing hydrophobic residues (Ceska et al. 
Nature 382:90 [1996]). Interestingly, three residues that are conserved between T5 and 
Taq, Lys 83, Arg 86 and Tyr 82 are all in the arch. These correspond to amino acids Lys 
83, Arg 86, and Tyr 82 in Taq DNA polymerase. The crystal structure for Taq (5' 
nuclease) has also been determined (Kim et al. Nature 376:612 [1995]). 

The crystal structure from the flap endonuclease-1 from Methanococcus 
janneschii also shows such a loop motif (Hwang et al, Nat. Struct. Biol, 5:707 [1998]). 
The backbone crystal structure of Mja FEN-1 molecules may be superimposed on T5 
exonuclease, Taq 5 '-exonuclease and T4 RNase H, An interesting feature common to all 
of these is the long loop. The loop of FEN-1 consists of a number of positively charged 
and aromatic residues and forms a hole with dimensions large enough to accommodate a 
single-stranded DNA molecule. The corresponding region in T5 exonuclease consists of 
three helices forming a helical arch. The size of the hole formed by the helical arch in T5 
exonuclease is less than half that formed by the LI loop in Mj FEN-1 . In T4 RNase H or 
Taq 5' exonuclease, this region is disordered. Some regions of the arch bind metals, 
while other regions of the arch contact nucleic acid substrate. Alignment of the 
amino-acid sequences of six 5' nuclease domains from DNA polymerases in the pol I 
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family show six highly conserved sequence motifs containing ten conserved acidic 
residues (Kim et al. Nature 376 [1995]). 

The effects of alterations in the arch region were examined. In Taq polymerase 
the arch region is formed by amino acids 80-95 and 96-109. Site directed mutagenesis 
was performed on the arch region. Alignment of amino acid sequences of the FEN and 
polymerase 5' nucleases suggested the design of 3 amino acid substitution mutations, 
P88E, P90E and G80E. These substitutions were made on the Taq4M polymerase mutant 
as a parent enzyme. Results indicated that although the background activity on the HP 
and X substrates shown in Figure 22 are tremendously suppressed in all mutants, the 
desirable 5' nuclease activity on proper substrates (IdT and IrT, Figure 24) is also 
reduced. Despite the sequence homology between Taq and Tth polymerases, they have 
very different activity on HP and X substrates. The alignment of the Taq and Tth 
polymerase arch regions also demonstrates regions of extensive sequence homology as 
well as minor differences. These differences led to the design of mutations L109F and 
Al lOT using Taq4M to generate Taq4M L109F/A1 lOT, and the mutant Taq 4M 
A502K/G504K/E507KyT514S to generate Taq 4M 

L109F/A1 10T/A502K/G504K/E507K/T514S mutant. These two mutations have 
drastically converted Taq4M enzyme to become more like Tth enzyme in terms of the 
background substrate specificity while the 5' nuclease activities on both DNA and RNA 
substrates are almost unchanged. 

4) Focused random mutagenesis 

As described above, physical studies and molecular modeling may be used alone 
or in combination to identify regions of the enzymes in which changes of amino acids are 
likely to cause observable differences in at least one aspect of cleavage function. In the 
section above, use of this information was described to select and change specific amino 
acids or combinations of amino acids. Another method of generating an enzyme with 
altered function is to introduce mutations randomly. Such mutations can be introduced 
by a number of methods known in the art, including but not limited to, PGR ampUfication 
under conditions that favor nucleotide misincorporation (REF), ampUfication using 
primers having regions of degeneracy {i.e., base positions in which different individual. 
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but otherwise similar oligonucleotides in a reaction naay have different bases), and 
chemical synthesis. Many methods of random mutagenesis are known in the art (Del Rio 
et al., Biotechniques 17:1 132 [1994]), and may be incorporated into the production of the 
enzymes of the present invention. The discussions of any particular means of 
mutagenesis contained herein are presented solely by way of example and not intended as 
a limitation. When random mutagenesis is performed such that only a particular region 
of an entire protein is varied, it can be described as "focused random mutagenesis." As 
described in the Experimental Examples, a focused random mutagenesis approach was 
apphed to vary the HhH and the thumb domains some of the enzyme variants previously 
created. These domains were chosen to provide examples of this approach, and it is not 
intended that the random mutagenesis approach be limited to any particular domain, or to 
a single domain. It may be applied to any domain, or to any entire protein. Proteins thus 
modified were tested for cleavage activity in the screening reactions described in 
Example 1, using the test substrates diagrammed in Figures 22 and 24, with the results 
described in Tables 6 and 7. 

Random mutagenesis was performed on the HhH region with the parent TaqSS or 
TthDN H785A mutants. None of the 8 mutants generated showed an improvement in 
activity compared to the parent enzyme (Table 6). In fact, mutations of the region 
between residues 198-205 have about 2-5 fold lower activity on both DNA and RNA 
substrates, suggesting that this region is essential for substrate recognition. Mutagenesis 
in the thumb region resulted in new mutations that improved 5' nuclease activity by 
20-100% on a DNA target and about 10% on an RNA target (Table 7). 

Numerous amino acids in each of the distinct subdomains play roles in substrate 
contact. Mutagenesis of these may alter substrate specificity by altering substrate 
binding. Moreover, mutations introduced in amino acids that do not directly contact the 
substrate may also alter substrate specificity through longer range or general 
conformation altering effects. These mutations may be introduced by any of several 
methods known in the art, including, but not limited to random mutagenesis, site directed 
mutagenesis, and generation of chimeric proteins. 

As noted above, numerous methods of random mutagenesis are known in the art. 
The methods applied in the focused random mutagenesis described herein may be applied 
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to whole genes. It is also contemplated that additional useful chimerical constructs may 
be created through the use of molecular breeding (See e.g., U.S. Pat. No. 5,837,458 and 
PCX Publications WO 00/18906, WO 99/65927, WO 98/31837, and WO 98/27230, 
herein incorporated by reference in their entireties). Regardless of the mutagenesis 
method chosen, the rapid screening method described herein provides a fast and effective 
means of identifying beneficial changes within a large collection of recombinant 
molecules. This makes the random mutagenesis procedure a manageable and practical 
tool for creating a large collection of altered 5' nucleases having beneficial 
improvements. The cloning and mutagenesis strategies employed for the enzymes used as 
examples are applicable to other thermostable and non-thermostable Type A 
polymerases, since DNA sequence similarity among these enzymes is very high. Those 
skilled in the art would understand that differences in sequence would necessitate 
differences in cloning strategies, for example, the use of different restriction 
endonucleases may be required to generate chimeras. Selection of existing alternative 
sites, or introduction via mutagenesis of alternative sites are well established processes 
and are known to one skilled in the art. 

Enzyme expression and purification can be accomplished by a variety of 
molecular biology methods. The examples described below teach one such method, 
though it is to be understood that the present invention is not to be limited by the method 
of cloning, protein expression, or purification. The present invention contemplates that 
the nucleic acid construct be capable of expression in a suitable host. Numerous methods 
are available for attaching various promoters and 3' sequences to a gene structure to 
achieve efficient expression. 

5) Site-Specific mutagenesis 

In some embodiments of the present invention, any suitable technique (e.g., 
including, but not limited to, one or more of the techniques described above) are used to 
generate improved cleavage enzymes (e.g., SEQ ID NO:221) with heterologous domains. 
Accordingly, in some embodiments, site-specific mutagenesis (e.g., primer-directed 
mutagenesis using a commercially available kit such as the Transformer Site Directed 
mutagenesis kit (Clontech)) is used to make a plurality of changes thoughout a nucleic 
acid sequence in order to generate nucleic acid encoding a cleavage enzyme of the 
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present invention. Insome embodiments, a plurality of primer-directed mutagenesis steps 
are carried out in tandem to produce a nucleic acid encoding a cleavage enzyme of the 
present invention. 

In some embodiments, a plurality of primer directed mutagenesis steps are 

5 directed to a selected portion of a nucleic acid sequence, to produce changes in a selected 
portion of a cleavage enzyme of the present invention. In other embodiments, a nucleic 
acid having changes in one selected portion is recombined with a nucleic acid having 
mutations in a different selected portion {e.g., through cloning, molecular breeding, or 
any of the other recombination methods known in the art), thereby creating a nucleic acid 

1 0 having mutations in a plurality of selected portions, and encoding a cleavage enzyme of 
the present invention. The mutations in each selected portion may be introduced by any 
of the methods described above, or any combination of said methods, including but not 
limited to methods of random mutagenesis and site-directed mutagenesis. 

For example, in one illustrative embodiment of the present invention, the nucleic 

1 5 acid sequence of SEQ ID NO:222 (a nucleic acid sequence encoding the cleavage 
enzyme of SEQ ID NO:221) is generated by making a plurality of primer-directed 
mutations to the nucleic acid sequence of SEQ ID NO: 104 (see Example 7 for the 
construction of SEQ ID NO:104). In some embodiments, each mutation is introduced 
using a separate mutagenesis reaction. Reactions are carried out sequentially such that 

20 the resulting nucleic acid (SEQ ID NO: 222) contains all of the mutations. In another 
illustrative embodiment of the present invention, the nucleic acid sequence of SEQ ID 
NO:222 is generated by making a plurality of primer-directed mutations, as described 
above, in the nuclease portion (e.g., as diagrammed in Figure 6) of SEQ ID NO: 111. The 
mutant nuclease portion is then combined with the "polymerase" portion of SEQ ID 

25 NO: 1 04 at the Not I site, using the recombination methods described in Example 4, 

thereby creating a single nucleic acid having SEQ ID NO:222, and encoding the cleavage 
enzyme of SEQ ID NO:221 . Following mutagenesis, the resulting altered polypeptide is 
produced and tested for cleavage activity using any suitable assay (e.g., including, but not 
limited to, those described in Examples 1 and 6). In some embodiments, the nucleic acid 

30 sequence encoding the cleavage enzyme of SEQ ID NO:221 (e.g., SEQ ID NO:222) is 
further modified using any suitable method. 
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V. Reaction Design for INVADER Assay Detection of RNA Targets; 

Approaches to designing INVADER assays for the detection of RNA targets can 
vary depending on the needs of a particular assay. For example, in some embodiments^ 
5 an RNA to be detected or analyzed may be present in a test sample at low levels, so a 
high level of sensitivity (/.e,, a low limit of detection, or LOD) may be desirable; in other 
embodiments, an RNA may abundant, and may not require an especially sensitive assay 
for detection. In some embodiments, an RNA to be detected may be similar to other 
RNAs in a sample that are not intended to be detected, so that a high level of selectivity 

10 in an assay is desirable, while in other embodiments, it may be desired that multiple 
similar RNAs be detected in a single reaction, so an assay may be provided that is not 
selective with respect to the differences among these similar RNAs. 

In some embodiments it is especially desirable to avoid detection of any DNA 
molecules related to the target RNA molecules in a reaction. In some embodiments, this 

15 is accomplished by designing INVADER assay probe sets to RNA splice junctions, such 
that only the properly spliced mRNAs provide the selected target sites for detection. In 
other embodiments, samples are handled such that DNA remains double stranded {e.g., 
the nucleic acids are not heated or otherwise subjected to denaturing conditions), and is 
thus not available to serve as target in an INVADER assay reaction. In other 

20 embodiments, cells are lysed under conditions that leave nuclei intact, thereby containing 
and preventing detection of the genomic DNA, while releasing the cytosolic mRNAs into 
the lysate solution for detection by the assay. 

In some embodiments, the INVADER assay is to be used for detection or 
quantitation of an entire RNA having a particular variation of a sequence (e.g,, a mutation 

25 a SNP, a particular spUced junction); in such embodiments, the location of the base or 
sequence to be detected is a determining factor in the selection of a site for the 
INVADER assay probe set to hybridize. In other embodiments, any portion of an RNA 
target may be used to indicate the presence or the amount of the entire RNA (e.g., as in 
gene expression analysis). In this case, the probe sets may be directed toward a portion 

30 of the RNA selected for optimal performance (e.g., sites determined to be particularly 
accessible for probe hybridization) as a target in the INVADER assay. 
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The discussion of INVADER assay probe design is divided into the following sections: 

i. Target site selection based on accessibility 

ii. Target site selection based on selectivity 
5 iii. Oligonucleotide design 

a Target-specific regions: length and melting temperature 

b. Non-complementary regions 

c. Folding and dimer analysis 
iv. Assay performance evaluation 

10 v. Design and assay optimization 



i. Target site selection based on accessibility 

One consideration in the selection of sites for detection is the availability of the 
target site for hybridization of the assay probe set. To simply use randomly selected 

15 complementary oligonucleotides for a given RNA target without prior knowledge of 

regions of the RNA that allow efficient hybridization can be an ineffective approach. For 
example, it is estimated that targeting RNA with antisense oligonucleotides based on 
random design results in one out of 18-20 tested oligonucleotides showing significant 
inhibition of gene expression (Sczakiel, Fronteirs in Biosciences 5:194 [2000]; Patzel et 

20 al. Nucleic Acids Res,, 27:4328 [1999]; Peyman et al, Bioh Chem. Hoppe-Seyler 
367:195 [1995]; Monia et ah. Nature Med,, 2:668 [1996]). Secondary and tertiary 
structures of RNA are thought to be the major reasons that influence the ability of an 
oligonucleotide to bind targeted regions of the RNA (Vickers et al. Nucleic Acids Res., 
28:1340 [2000]; Lima et al. Biochemistry 31:12055 [1992]; Uhlenbeck, J. MoL Biol., 

25 65:25 [1972]; Freier and Tinoco, Biochemistry 14:3310 [1975]), This is due to the 
hybridization kinetics and thermodynamics of destroying any structural motifs of the 
RNA and, in return, hybridizing the complementary DNA oligonucleotide (Patzel et al. 
Nucleic Acids Res,, 27:4328 [1999]; Mathews etal, RNA 5:1458 [1999]). Thus, the 
ability to identify regions of RNA that are "accessible" for hybridization is important for 

30 design and selection of effective oligonucleotides. 
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There are several experimental and theoretical methods available for identifying 
accessible regions in RNA. These include the use of RNase-H footprinting (Ho et al. 
Nature Biotechnology 16:59 [1998]; Mateeva et al. Nucleic Acids Res., 25:5010 [1997]; 
Mateeva et al. Nature Biotechnology 16:1374 [1998]), complementary arrays of 

5 oUgonucleotide libraries (Southern et al, Nucleic Acids Res., 22:1368 [1994]; Mir and 
Southern, Nature Biotechnology 17:788 [1999]), ribozyme libraries with random 
hexamer internal guide sequences (Campbell and Cech, RNA 1:598 [1995]; Lan et al. 
Science 280:1593 [1998]), and RNA and DNA structure prediction computer programs 
(Sczakiel, Frontiers in Biosciences 5:194 [2000]; Patzel et al, Nucleic Acids Res., 

10 27:4328 [1999]; Zuker, Science 244:48 [1989]; Walton et al, Biotechnol. Bioeng., 65: 1 
[1999]). Recently, new methods have been developed that use primer extension to 
identify sites in RNAs that are accessible for hybridization. Target nucleic acids {e.g., 
mRNA target nucleic acids) are contacted with a plurality of primers containing a 3' a 
region of degenerate sequence and primer extension reactions are conducted. Where the 

15 target nucleic acid is an RNA molecule, preferred enzymes for use in the extension 
reactions are reverse transcriptases, which produce a DNA copy of the RNA template. 
Folded structures present in the target nucleic acid affect the initiation and/or efficiency 
of the extension reaction. The extension products of the primers are analyzed to provide 
a map of the accessible sites. For example, certain extension products are not generated 

20 where the primer is complementary to a sequence that is involved in a folded structure. 
Regions of the target nucleic acid that do not allow hybridization of the primer and do not 
result in the production of an extension product are considered inaccessible sites. In 
contrast, the presence of an extension product indicates that the primer was able to bind 
to an accessible region of the target nucleic acid. Such methods are referred to herein as 

25 "reverse transcription with random oligonucleotide libraries" or "RT-ROL" (HT AUawi, 
et al, RNA 7(2):314-27 [2001]). The use of aphysical measurement such as RT-ROL or 
array hybridization provides the most direct evidence of the accessibility of a site on an 
RNA strand. In general, INVADER assays directed toward accessible regions produce 
stronger signals for a given amount of RNA than assays directed toward less accessible 

3 0 regions of an RNA strand. For the detection of rare RNAs {e.g. , fewer than about 5 ,000 
to 10,000 copies per INVADER assay reaction), or in any assay wherein it is desirable to 
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have the best {Le., lowest) limit of detection possible, it may be beneficial to start the 
assay design by analyzing the RNA structure using RT-ROL or another method of 
physical analysis. 

In other embodiments, ease of assay design may be more important than creation 
of an assay with a particularly low LOD, Structure prediction software can simplify the 
task of determining which parts of an RNA are likely to be single stranded, and thus be 
more accessible for probe hybridization. As first step, the sequence of an RNA to be 
detected is entered into an electronic file. It may be entered manually or imported from a 
file (e.g,, a sequence data file, or a word processing file). In some embodiments, the 
sequence is downloaded from a database, such as GenBank or EMBL. The RNA 
sequence can then be analyzed using a program such as mfold (Zucker, Science 244:48 
[1989]), OUgoWalk (Mathews et al, RNA 5:1458 [1999]), and variations of both 
(Sczakiel, Frontiers in Biosciences 5:194 [2000]; Patzel et al. Nucleic Acids Res., 
27:4328 [1999]; Walton et ai, Biotechnol. Bioeng., 65:1 [1999]), 

Mfold Analysis for target RNA structure prediction. 

The output of mfold analysis can be used in several ways to assist in identifying 
accessible regions of an RNA target molecule. In one embodiment, the mfold program is 
used to generate an "ss count" file for identifying regions least likely to be involved in 
intra-strand baseparing. In another embodiment, the mfold program is used to generate a 
'',ct" file, a file used as input information for use with RNA Structure 3,5 to perform an 
OligoWalk analysis. In preferred embodiments, for either use, the sequence to be 
detected is entered into mfold. In a preferred embodiment, the settings used in the mfold 
analysis include: 

• Folding Temperature: 37°C (Even though the INVADER reaction may not be 
conducted at this temperature.) 

• % Suboptimality: 5 

• # foldings: 50 

• Window Parameter: Default 

• Maximum distance between paired bases: No Limit 

• Select BATCH folding 
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• Enter: an e mail address where the results are to be sent when ready 

• Image Resolution: High 

• Structure Format: Bases 

• Base Number Frequency:Default 

• Structure Rotation Angle: 0 

• Structure Annotation: SS-Count 

• 1 M NaCl (Australian mfold Internet site only) 

When results are ready, an e mail message is sent containing the Web address of 
the results. The only file that is necessary for subsequent INVADER assay probe design 
analysis is the SS-Count file, which is then downloaded fi-om the Results page. An 
exemplary mfold analysis using a GenBank entry for Human Ubiquitin (#4506712) is 
shown below: 

SS-Count analysis for accessible sites identification. 

The SS-Count file is then imported into an Excel spreadsheet file and the 
following options are chosen: Data Type = Delimited <press Next>; Delimited = Select 
(x) Spaces; (x) Treat multiple delimiters as one <press Next>; Column for data format = 
General. Selecting these options results in the import into Excel of three columns of data 
(Figure 39). The first line in the first column is the total number of stable structures 
mfold found under the parameters used in the folding. With the Ubiquitin example, there 
were 12 structures found. 

The rest of the first column is the RNA nucleotide position numbered fi-om 1. 
The second column is the raw SS-Count number and represents the number of times the 
corresponding base was NOT base-paired as part of some secondary structure. The third 
column is the sequence of the RNA analyzed, identifying the base at each numbered 
position. By looking for bases that are involved in fewer structures {i.e., bases listed in 
colunm 3 corresponding to the higher numbers in column 2), it is possible to identify 
regions of the mRNA (identified by position number in column 1) that are more likely to 
be fi-ee of intra-strand base-pairing, and thus are more likely to be available for detection 
using the INVADER assay. 
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One way of viewing the data is to calculate the ranning average SS-count for a ten 
nucleotide stretch of the RNA (the Ave(lO) Index) and chart the Ave(lO) Index against 
the base pair position (See e.g.. Figure 48). 

An alternative plot is to graph the nucleotide position against the Ave(lO) Index 

5 expressed as a percentage of the total number of structures found by mfold (Fig 39). As 
with the raw SS-count table, regions of the RNA corresponding to higher numbers in the 
Ave(lO) Index are involved in fewer predicted folded structures. Viewing the Ave(lO) 
Index in either a chart or graph format reduces the complexity of the data, and can reveal 
longer stretches of the RNA that are more likely to be structure-free. For example, from 

10 the graph of the running average, a user can pick out all of the major peak areas as likely 
regions for INVADER assay probe design. This creates an SS-Count Candidate List. In 
the Human ubiquitin example, there is one major peak and about 6 other peaks (Fig. 39) 
The next step is to refer back to the raw {i.e., not the running average) SS-Count 
data from within each of the peak areas, and identify the residue where the running 

15 average is changing in magnitude, this is a local "turn" and is generally a good candidate 
residue to be positioned at the INVADER assay probe set cleavage site. For example, in 
Human Ubiquitin, residue Gl 19 is found at a minor local turn within a globally 
accessible region (Fig. 40). The INVADER assay probe set with the cleavage site at this 
location is a good performer in detection of this RNA, Placing the cleavage site at Gl 14 

20 did not result in better detection even though it had a higher %Ave(10) value. While not 
limiting the present invention to any particular mechanism, and an understanding of the 
mechanism is not necessary to practice the invention, this is likely to be because this area 
was less accessible to the Probe or INVADER oligonucleotides than was position Gl 19. 

OligoWalk Structure prediction with RNA Structure 3.5 for accessible sites 
25 identification 

In some embodiments, the program OligoWalk, a module of the software 
"RNAStructure" (Mathews et al, RNA 5:1458 [1999]) is used in the selection of sites 
that are more Ukely to be accessible for oligonucleotide binding, OligoWalk uses sets of 
thermodynamic parameters for both RNA and DNA, and their hybrids (AUawi and 
30 SantaLucia, Biochemistry 36:10581 [1997]; Mathews et al, J. Mol. Biol, 288:91 1 
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[1999]; Sugimoto et al. Biochemistry 34:1 121 1 [1995]) in an algorithm that relies on 
mfold for RNA secondary structure prediction (Zucker, Science 244:48 [1989]). 
OligoWalk is designed to predict the most favorable regions of an RNA target for 
designing antisense oligonucleotides by estimating the overall thermodynamics of 

5 hybridizing an antisense oUgomer to the RNA by taking into account the thermodynamics 
of destroying any structural motifs in the RNA target or the antisense oligonucleotide. 
The affinity of the oligomer to its target is expressed as an overall Gibbs free energy 
change of a self-structured oligomer, and of a target associating into an oUgomer-target 
complex. This free energy is usually a negative number, indicating favorable binding, 

10 and is expressed in 'kcal/mol' units. OhgoWalk analysis is performed with 8 to 1 5 base 
oUgonucleotide size to resemble the average length of the analyte specific region of the 
Signal Probe. Plotting the total binding energy against the length of the RNA generates a 
graph of peaks and valleys. The lowest negative values generally indicate the most 
favorable sites for ohgonucleotides to bind. The most inaccessible regions have positive 

1 5 binding energy values, and generally are a poor sites for assay probe design 

In a preferred embodiment, the OligoWalk module of RNA Structure 3.5 is used 
to determine binding energies by performing an 8-base OhgoWalk using the following 
settings: 

■ Break Local Structure 

20 ■ Include suboptimal structures 

■ Ohgo Length: 8nt 

■ Oligo Concentration: 100 nM 

■ Oligo Type: DNA 

■ Walk entire Target RNA 

25 

When these parameters have been set, the sequence file to be folded (the ".ct" 
output file from mfold) can be selected and opened. Once the sequence has been folded, 
a report can be created using the Output menu. The report is imported into Excel and the 
data generated above is plotted. In a preferred embodiment, the OligoWalk data is 
30 graphed with the SS-Count data. The regions displaying the lowest free energy values 
{i.e., the largest negative numbers) are generally the most likely to be accessible for 
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hybridization. In preferred embodiments, the 3' end and the majority of the target- 
binding region of the probe oligonucleotide complement an accessible region of the target 
RNA. In particularly preferred embodiments, the majority of the binding site for the 
corresponding INVADER oligonucleotide falls within the same accessible region. In 
another preferred embodiment, the binding site for an INVADER oligonucleotide falls 
within a nearby accessible region. 

An INVADER oligonucleotide can generally be positioned to bind to a less 
accessible site. While not limiting the present invention to any particular mechanism, it 
is observed that the INVADER oligonucleotides are generally longer than probe 
oligonucleotides used in the INVADER assay reactions and, because they are generally 
designed to remain bound to the target at the reaction temperature, they will be selected 
to have a T^s about 12 to 15 higher than that of a corresponding probe. Consequently, 
INVADER oligonucleotides may more readily break the local target structure, and thus 
may be less dependent on the accessibility of the target-binding site. 

In selecting among accessible sites for the design of INVADER assay 
oligonucleotides, the base composition of the site is also considered. It has been 
observed that stretched of more than 4 or 5 of the same nucleotide in a row (e.g., 
. . .AAAA. . . or . . .CCCC. . .) in any portion of the binding site for the assay 
oligonucleotides may reduce the performance of the probe set in the assay {e.g., by 
increasing background or decreasing specificity). Thus, in preferred embodiments, any 
stretches comprising four or more repeated bases are generally avoided. Another 
consideration is the effect of base composition on lengths of the oligonucleotides in the 
probe set. In many cases, targeting A-T rich sequences requires the use of longer 
oligonucleotides for a reaction performed at a given temperature, compared to the length 
of oligonucleotides targeted to sequences having a more even distribution of A-T and G- 
C bases. Longer oligonucleotides can be more prone to formation of intrastrand 
structures and dimer structures. Thus, it is preferred that the distribution or A-T bases 
and G-C bases within a target region be as close to even {i.e., about 50% G-C content) as 
the region to be detected permits. In particularly preferred embodiments, the distribution 
of A-T and G-C positions is evenly distributed across the binding sites {e.g., not having 
all A-T positions in one half, with all G-C positions in the other). 
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ii. Target site selection based on selectivity 

In some embodiments, probe sets are designed to examine highly homologous, or 
closely related RNA targets (i. e. , targets that are very similar in sequence). In such 
embodiments, the RNA or homologous cDNA sequences are compared, e.g., using an 
alignment program such as MEGALIGN (DNAstar Madison, WI). 

In some embodiments, selectivity is provided by designing probe sets to detect 
splice junctions. Sphce junctions can be identified by aligning the cDNA and gene 
sequences using an alignment program {e.g. MEGALIGN) or under the BLAST menu at 
the NCBI website (BLAST 2 sequences). Splice junctions are also often listed in the 
GenBank report (intron/exon sites). INVADER assay oligonucleotide sets are designed 
such that the probe and ESTVADER oligonucleotides are complementary to the coding 
strand (mRNA), generally with the cleavage site being as close to the sphce junction as 
possible. In some embodiments, different spUce junctions within an mRNA are analyzed 
for accessibility, as described above. In preferred embodiments, probe sets are designed 
to detect one or more sphce junctions showing greater accessibility compared to the 
accessibility of other splice junctions within the same RNA target. 

In some embodiments designed to exclude detection of RNAs related to the target 
RNA, sequences are examined to identify bases that are unique to the target RNA when 
compared to the other similar sequences from which the target is distinguished. 
Generally, the unique base is positioned to hybridize to the 5' end of the target-specific 
region of the probe oUgonucleotide. In some embodiments, two adjacent bases are 
unique to the target compared to the related RNA. If two adjacent unique bases are 
available in an appropriately accessible portion of the target RNA, it is preferred that 
these bases be used as the site around which the probe and INVADER oligonucleotides 
sets are designed. In some embodiments, the two unique bases are positioned such that 
the site of cleavage of the probe is between the two base-pairs they form with the probe. 
In other embodiments, one of the unique bases is in the last position of the hybridization 
site of the INVADER oligonucleotide (i.e., it is positioned to base-pair to the penultunate 
residue on the 3' end of the INVADER oligonucleotide). 
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In some embodiments, the assay is designed to include detection of RNAs that are 
similar, but not identical, to the target RNA. If the assay is being designed for inclusive 
detection, the compared sequences are examined to identify sites having complete 
homology. Such designs can be created to detect homologous sequences in the same 

5 species or between species. Generally, the most homologous regions are selected as 
hybridization targets for probe oligonucleotides. Generally, some variation can be 
tolerated, for example, if it is not at the base that would hybridize to the 5' end of the 
target-specific region of a probe. In some embodiments, variation is accommodated by 
the use of degenerate bases in the INVADER assay oligonucleotides (e.g., mixtures of 

10 bases are used at positions within thesynthesized probe, INVADER and/or stacker 
oligonucleotides, said mixtures selected to complement the mixture of specific bases 
present in the collection of related target RNAs). 

iii. Oligonucleotide design 

15 a. Target-specific regions: length and melting temperature 

As described above in Section I (a) concerning the ohgonucleotide design, in 
some embodiments, the length of the analyte-specific regions are defined by the 
temperature selected for running the reaction. Starting from the desired position (e.g., a 
variant position or splice junction in a target RNA, or a site corresponding to a low free 

20 energy value in an OligoWalk analysis) an iterative procedure is used by which the length 
of the ASR is increased by one base pair until a calculated optimal reaction temperature 
(Tm plus salt correction to compensate for enzyme and any other reaction conditions 
effects) matching the desired reaction temperature is reached. In general probes are 
selected to have an ASR with a calculated of about 60 °C if a stacking oligonucleotide 

25 is not used, and a of about 50 to 55 ''C if a stacking ohgonucleotide is used (a stacking 
oligonucleotide typically raises the Tm of a flanking probe ohgonucleotide by about 5 to 
15 ^^C). If the position of variation or a splice junction is a starting position, then the 
additions are made to the 3* end of the probe. Altematively, if the 3' end of the probe is 
to be positioned at the most accessible site, the additions are in the 5' direction. In some 

30 embodiments, wherein a stacker oligonucleotide is to be used, it is preferred that the 
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probe be designed to have a 3' base that has stable stacking interaction interface with the 
5' base of the stacker oligonucleotide. The stability of coaxial stacking is highly 
dependent on the identity of the stacking bases. Overall, the stability trend of coaxial 
stacking in decreasing order is purine:purine > purine :pyrimidine « pyrimidne:purine > 
pyrimidine:pyrimidine. In other embodiments employing a stacker, a less stable stacking 
interaction is preferred; in such cases the probe 3' base and/or the stacker 5' base are 
selected to provide a leass stable stacking interaction. In some embodiments, the probe 
3' base and/or the stacker 5' base are selected to have a mismatch with respect to the 
target strand, to reduce the strength of the stacking interaction. 

The same principles are also followed for INVADER oligonucleotide design. 
Briefly, starting from the position N, additional residues complementary to the target 
RNA starting from residue N-1 are then added in the upstream direction until the stability 
of the INVADER-target hybrid exceeds that of the probe (and therefore the planned assay 
reaction temperature). In preferred embodiments, the stability of the INVADER-target 
hybrid exceeds that of the probe by 12-15 °C. In general, INVADER oligonucleotides 
are selected to have a T^ near 75 °C. Software applications, such as 
INVADERCREATOR (Third Wave Technologies, Madison, WI) or Oligonucleotide 5.0 
may be used to assist in such calculations. 

If a stacking oligonucleotide is to be used, similar design principles are appUed. 
The stacking oligonucleotide is generally designed to hybridize at the site adjacent to the 
3' end of the probe oligonucleotide, such that the stacker/target helix formed can 
coaxially stack with the probe/target helix. The sequence is selected to have a calculated 
Tm of about 60 to 65 °C, with the calculation based on the use of natural bases. 
However, stacking oligonucleotides are generally synthesized using only 2'-0-methyl 
nucleotides, and consequently, have actual Tn,s that are higher than calculated by about 
0.8°C per base, for actual TmS close to 75 °C. 

In some embodiments, ARRESTOR oligonucleotides are included in a secondary 
reaction. ARRESTOR oligonucleotides are provided in a secondary reaction to sequester 
any remaining uncleaved probe from the primary reaction, to preclude interactions 
between the primary probe and the secondary target strand. ARRESTOR 
oligonucleotides are generally 2'-0-methylated, and comprise a portion that is 
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complementary to essentially all of their respective probe's target-specific region, and a 
portion that is complementary to at least a portion of the probe's flap regions {e.g., six 
nucleotides, counted from the +1 base towards the 5' end of the arm). 

b. Non-complementary regions 

5 Probe 5' Arm selection 

The non-complementary arm of the probe, if present, is preferably selected (by an 
iterative process as described above) to allow the secondary reaction to be performed at a 
particular reaction temperature. In the secondary reaction, the secondary probe is 
generally cycUng, and the cleaved 5' arm (serving as an INVADER oligonucleotide) 
1 0 should stably bind to the secondary target strand. 



INVADER oligonucleotide 3* terminal mismatch selection 

In preferred embodiments, the 3' base of the INVADER oligonucleotide is not 
complementary to the target strand, and is selected in the following order of preference 
1 5 (listed as INVADER oligonucleotide 3' base/target base): 
C in target: C/OA/OT/OG/C 
A in target: A/A > C/A > G/A > T/A 

Gin target: A/G > G/G > T/G > C/G 

U in target: C/U > A/U > T/U > G/U 

20 

c. Folding and dimer analysis 

In some embodiments, the oligonucleotides proposed for use in the INVADER 
assay are examined for possible inter- and intra-molecular structure formation in the 
absence of the target RNA. In general, it is desirable for assay probes to have fewer 
25 predicted inter- or intra molecular interactions. In some embodiments, the program 
OLIGO (e.g., OLIGO 5.0, Molecular Biology Insights, Inc., Cascade, CO) is used for 
such analysis. In other embodiments, the program mfold is used for the analysis. In yet 
other embodiments, the RNAStructure program can be used for dimer analysis. The 
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following sections provide stepwise instructions for the use of these programs for 
analysis of INVADER assay oligonucleotides. 

OLIGO 5.0 analysis for Probe structure and interaction prediction. 

5 Analysis of INVADER oligonucleotides using OLIGO 5 .0 comprises the 

following steps. All menu choices are shown in UPPER CASE type. 

1. Launch OLIGO 5.0 and open a sequence file for each mRNA to be analyzed. This 
is done by using a menu to select the following 

10 • Choose FILE- >NEW 

• Paste in longest available sequence 

• Choose ACCEPT & QUIT (F6) 

2. Set Program settings to default 

Choose FILE - >RESET- >ORIGINAL DEFAULTS 

15 3. Identify Probe Oligonucleotide 

• Select OLIGO LENGTH to be around 1 6 nucleotides (open the menu for this 
option by using ctrl-L keystrokes). 

• Move the cursor indicating the 5' end of the Current Oligo until the 3' end is 
located at the candidate cleavage site residue. 

20 • Choose ANALYSE - >DUPLEX FORMATION - >CURRENT OLIGO (ctrl-D) 

for a rough determination of the extent of dimer and hairpin formation. 

• Confirm length of analyte region corresponds with desired reaction temperature 
[e.g., through the use of Tm calculation as described in the Optimization of 
Reaction Conditions, I (c) of the Detailed Description of the Invention] 

25 • Select the "LOWER" button in OLIGO 5 .0 to copy the anti-sense sequence (this 

will be the analyte-specific region of the actual probe oligonucleotide and is anti- 
sense to the RNA strand.) 

• Import into a database file. 
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Save to computer memory. 



4. Identify INVADER Oligonucleotide 

• Choose sequence adjacent to the probe oUgonucleotide identified from step 3. 

• Select OLIGO LENGTH to -24 nucleotides 

• Confirm length of analyte region corresponds with desired reaction temperature 
[e.g., through the use of Tm calculation as described in the Optimization of 
Reaction Conditions, I (c) of the Detailed Description of the Invention, about 75 
°C for INVADER oligonucleotides). Select the "LOWER" button in OLIGO 5.0 
to copy the corresponding anti-sense sequence (this will be the analyte-specific 
region of the actual INVADER oUgonucleotide.) 

• Import into a database file. 

• Save to computer memory. 

5. Addition of Cleaved Arm Sequence and INVADER Oligonucleotide Mismatch 
Sequence. 

• Export the Probe oligonucleotide as Upper Primer. 

• Export the INVADER oligonucleotide as Lower Primer. 

• EDIT UPPER PRIMER to add in a candidate arm sequence (selected, for 
example, as described above). 

• Check that the arm sequence does not create new secondary structures (analysis 
performed as described above). 

• EDIT LOWER PRIMER to add in the 3' mismatched nucleotide that will overlap 
into the cleavage site (selected according to the guidelines for this mismatched 
bases, provided above). 

• Select all Upper and Lower Primer boxes in the "Print/Save Options" 

• PRINT ANALYSIS of Upper (Probe) and Lower (INVADER) oligonucleotides 
and check for lack of stable secondary structures. 

• Save both mRNA sequence and oligonucleotide sequence database files before 
quitting the program. 
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Generally, oligonucleotides having detected intra-molecular formations with 
stabilities of less than -6 AG are preferred. Less stable structures represent poor 
substrates for CLEAVASE enzymes, and thus cleavage of such structures is less likely to 
contribute to background signal. Probe and INVADER oligonucleotides having less 
affinity for each other are more available to bind to the target, ensuring the best cycling 
rates. 

The Tm of dimerized probes (i.e., probes wherein one probe molecule is 
hybridized to another probe molecule) should ideally be lower than the for the probe 
hybridized to the target, to ensure that the probes preferentially hybridize to the target 
sequence at the elevated temperatures at which INVADER assay reactions are generally 
conducted. Similarly, the for the INVADER oligonucleotide hybridized either to 
itself or to a probe molecule should be lower than the INVADER oligonucleotide/target 
Tm. It is preferred that dimer T^s (i.e., Probe/Probe and Probe/INVADER 
oligonucleotide) be 25°C or less to ensure that they will be unlikely to form at the 
planned reaction temperature. 

The melting temperatures for each of these complexes can be determined as 
described above in Optimization of Reaction Conditions, I (c) of the Detailed Description 
of the Invention, or by using the OLIGO software. Once RNAs sites and several 
candidate INVADER assay oligonucleotide sets are selected according to the process 
outlined above, the candidate oligonucleotide sets can be ranked according to the degree 
to which they comply with preferred selection rules, e.g., their location on the SS-Count 
average plot (peak, valley, neither), and the energetic predictions of probe and 
INVADER oligonucleotide interactions. In some embodiments, the ranked probe sets are 
tested in order of rank to identify one or more sets having suitable performance in an 
RNA INVADER assay. In other embodiments, several of the top ranked sets (e.g., two, 
three or more) are selected for testing, to rapidly identify one or more sets having suitable 
or desireable performance. 

Mfold analysis for probe structure and interaction prediction 

Analysis of probe and INVADER oligonucleotide interactions may be performed 
using mfold for DNA provided by Michael Zuker, available through Rensselaer 
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Polytechnic Institute at bioinfo.niath.rpi.edu/~mfoldydna/forml .cgi. The analysis is 
performed without changing the default ionic conditions, and with a selected temperature 
of 37 °C and with % suboptimaUty set to 75. Each sequence (e.g., probe, INVADER 
oligonucleotide, stacker, etc.) is folded using the program to check for any unimolecular 
structure formation (e.g., hairpins). The energies provided by mfold gives for 
unimolecular structures can be used as provided, without further calculations. 

Bimolecular structure formation for a given oligonucleotide is assessed by typing 
in the oligonucleotide sequence (5' to 3') followed by the sequence of a small, stable 
hairpin forming sequence (e.g., CCCCCTTTTGGGGG [SEQ ID NO:707]), followed by 
the same oUgonucleotide sequence, again listed 5' to 3. Constraints are entered to require 
that these Ts remain single-stranded and the strings of Cs and Gs in this spacer are 
basepaired. The command "F" is used to force basepairing, while the command "P" is 
used to prohibit basepairing, and the positions of the forced or prohibited basepairs are 
counted from the 5' end. For example, if the sequence of interest is a 20-mer, then the 
following is entered: 

F 21 0 5 [this forces the C's, C21 to C25, to base pair] 

P 26 0 4 [this forces the T's, T26 to T29, to be single stranded] 

F 30 0 5 [this forces the G's, G30 to G34, to base pair] 

On examination of the resulting structures, the stability of each can be estimated 
by subtracting the stability (i.e., the thermodynamic measures) of the central spacer 
hairpin from the total result (i.e.. Thermodynamics of possible structure = mfold structure 
thermodynamics - core hairpin thermodynamics). For convenience, in some 
embodiments, any nearest neighbor interactions between the central hairpin and dimers 
formed by the test sequence are ignored for this calculation; a more accurate analysis 
would require consideration of this interaction. The core hairpin formed by 
CCCCCTTTTGGGGG (SEQ ID NO:707) has the following thermodynamics: AG = -5.3 
AH = -37.8; AS = -104.8. 

The process can be demonstrated using the following probe sequence: 5'- 
CCCTATCTTTAAAGTTTTTAAAAAGTTTGA-3' (SEQ ID NO:70S). The 
oligonucleotide sequence is examined by mfold analysis for bimolecular structures using 
the following steps. 
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1- In mfold sequence box type: 

CCCTATCTTTAAAGTTTTTAAAAAGTTTGACCCCCTTTTGGGGGCCC 

TATCTTTAAAGTTTTTAAAAAGTTTGA (SEQ ID NO: 137) 



5 2- In the constraint box type: 
P3604 
F31 05 
F4005 

10 Results (showing one): 

structure 1 

dG = -14.2 dH = -150. S dS = -439.5 Tm = 69.3 

CCCTATCTTT |G G T 

Tyj-j^YIARPJ^ TTTGA CCCCC T 
15 TTT AAAAATTTTT AAATT GGGGG T 
AG *G G TCTATCCC T 

To evaluate the stability of the duplex: 

CCCTATCTTT | G G 
20 AAA TTTTTAAAAA TTTGA 

TTT AAAAATTTTT AAATT 
AG *G G TCTATCCC 

the thermodyanamic values for the hairpin alone are subtracted from the values for the 
25 complete structure: 

AG = -14.2 -(-5.3) --8.9, 
AH = - 150.5 -(-37.8) = -112.7, 
AS = -439.5 - (-104.8) = -334.7, 
Using a calculation wherein (°C) = {AH / [AS + R In (CT/4)]} - 273.15, wherein R is 
30 the gas constant 1 .987 (cal/K.mol), In is the natural log, and CT is the total single strand 
concentration in Molar, this results in a calculated of 46.1 °C for the non-hairpin 
portion of the structure. 

The above method is not limited to the use of the core hairpin sequence 
CCCCCTTTTGGGGG but rather any stable hairpin sequences can be used. For 
35 example, CGCGCGGAACGCGCG (SEQ ID NO: 138) or CCCGGGTTTTCCCGGG 
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(SEQ ID NO: 139). However, if a different hairpin sequence is used, one needs to 
calculate its stability using mfold and use its thermodynamics in the subsequent 
calculations. 

5 RNAStructure for oligonucleotide interaction prediction 

Dimer formation can also be evaluated using the RNAStructure program. Unlike 
mfold, RNAStructure allows the calculation of all possible oligonucleotide- 
ohgonucleotide interactions and provides an output .ct file. One can then view the 
structures using any .ct viewing program such as RNAStructure or RNAvis (1997, P. 

10 Rijk, University of Antwerp (UIA), available on the Internet at rma.uia.ac.be/mavis) and 
evaluate the stability of any dimer formation using the nearest-neighbor model (Borer et 
al., 1974) and DNA nearest-neighbor parameters (Allawi & SantaLucia, 1997). 

For example, to evaluate the propensity of the sequence 5' 
AGGCGCACCAATTTGGTGTT 3' (SEQ ID NO: 140) for dimer formation using the 

1 5 DNA Fold Intermolecular module of RNAStmcture, the sequence is saved into a file 
{e.g., probe.seq) and the following parameters are set: 
Sequence file 1 : probe.seq 
Sequence file 2: probe.seq 
CT file: dimer.ct 

20 Max % Energy difference: 50 
Max number of stmctures: 20 
Window size: do not change 

After the calculation is done, one can view the resulting .ct file using the "view" module 
25 of RNAStmcture. Generally, there will be several stmctures within the .ct file. The view 
module is used to view them individually. One of the dimers that the test sequece, above, 
can form according to RNAStmcture is: 
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AGGCG TT 
CACCAATTTGGTG 
GTGGTTTAACCAC 
TT GCGGA 

5 

According to the nearest-neighbor model (Le,, using DNA nearest-neighbor and 
mismatch parameters [Allawi & SantaLucia, 1997]), the stability of this duplex in IM 
NaCl and at a probe concentration of 100 [xM is: 
AG%7 = -10.07 
10 AH=-87.6 
AS=-250.1 
Tm=50.rc 

By changing the identities of Sequence Files 1 & 2, RNAStructure can be used to 
15 evaluate the possibility of any dimer formation between pairs of all of the DNA 
oligonucleotides present in an INVADER assay reaction. 

iv. Assay performance evaluation 

Probe sets selected according to the guidelines provided above can be tested in the 
20 INVADER assay to evaluate performance. While the oligonucleotides are designed to 
perform at or near a particular desired reaction temperature, the best performance for a 
given design may not be precisely at the intended temperature. Thus, in evaluating any 
new INVADER assay probe set, it can be helpful to examine the performance in the 
INVADER assay conducted at several different reaction temperatures, over a range of 
25 about 10 to 15 ^C, centered around the designed temperature. For convenience, 

temperature optimization can be performed on a temperature gradient thermocycler with 
a fixed amount of RNA (e.g., 2,5 amoles of an in vitro transcript per reaction),- and for a 
fixed amount of time (e.g., 1 hour each for Primary and Secondary reactions). The 
temperature gradient test will reveal the temperature at which the designed probe set 
30 produces the best performance (e,g,, the highest level of target-specific signal compared 
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to background signal, generally expressed as a multiple of the zero-target background 
signal, or "fold over zero"). 

The results can be examined to see how close the measured temperature optimum 
is to the intended temperature of operation. In some embodiments, it is desirable to have 
probe sets that operate at or near a pre-selected temperature. If the measured temperature 
optimum is higher than the desired reaction temperature, a probe design can be altered in 
ways that tend to reduce the probe/target Tm (e.g., shortened by one or more bases, or 
altered to contain one or more mismatched bases). In some embodiments, wherein a 
stacker oligonucleotide is not used, wherein the reaction temperature is more than 7 °C 
above the desired reaction temperature, and wherein the performance (e.g., the fold over 
zero) is acceptable, use of a 3' mismatch on the probe oligonucleotide is likely to lower 
the reaction temperature without otherwise altering the assay performance. 

An LOD determination can be made by performing reactions on varying amounts 
of target RNA (e.g., an in vitro transcript control RNA of known concentration). In 
preferred embodiments, a designed assay has an LOD of less than 0.05 attomole. In 
particularly preferred embodiments, a designed assay has an LOD of less than 0.01 
attomole. It is contemplated that the same guideline provided above for reducing the 
LOD of a designed assay may be used for the purpose of raising the LOD of a designed 
assay, i.e., to make it LESS sensitive to the presence of a target RNA. For example, it 
may be desirable to detect an abundant RNA and a rare RNA in the same reaction. In 
such a reaction, it may be desirable to attenuate the signal generated for the abundant 
RNA so that it does not overwhelm the signal from the rarer species. In some 
embodiments this may be done by designing probe sets for reduced signal generation, 
e.g., an LOD of at least (not less than) 0.5 attomoles. In some embodiments, a single step 
INVADER assay may be used for detection of abundant targets in a sample, while 
sequential INVADER reactions to amplify signal, as described in Section II, may be used 
for less abundant analytes in the same sample. In preferred embodiments, the single step 
and the sequential INVADER assay reactions for the different analytes are performed in a 
single reaction. 

In some embodiments, time course reactions are run, wherein the accumulation of 
signal for a known amount of target is measured for reactions run for different lengths of 



160 



time. This measurement will establish the linear ranges, i.e,, the ranges in which accurate 
quantitative measurements can be made using a given assay design, with respect to time 
and starting target RNA level. 

5 V. Design and assay optimization 

Some designed assays may not meet the preferred performance criteria described 
above. A number of variations on the performance of INVADER assay reactions have 
been described herein. In optimizing performance of the INVADER assay for the 
detection of RNA targets, these variations may be used alone or in combination. For 

10 example, in some embodiments, a stacker oligonucleotide is employed. While not 
limiting the present invention to any particular mechanism of action, in some 
embodiments, a stacker oligonucleotide may enhance performance of an assay by altering 
the hybridization characteristics {e.g., Tm) of a probe or an INVADER oligonucleotide. 
In some embodiments, a stacker oligonucleotide may increase performance by enabling 

1 5 the use of a shorter probe. In other embodiments, a stacker oligonucleotide may enhance 
performance by altering the folded structure of the target nucleic acid. In yet other 
embodiments, the enhancing activity of the stacker oligonucleotide may involve these 
and other mechanisms in combination. 

In other embodiments, the target site may be shifted. In some embodiments, 

20 reactions are optimized by testing multiple probe sets that shift along a suspected 

accessible site* In preferred embodiments, such probe sets shift along the accessible site 
in one to two base increments. In embodiments wherein accessible sites have previously 
been predicted only by computer analysis, physical detection of the accessible sites may 
be employed to optimize a probe set design. In preferred embodiments, the RT-ROL 

25 method of detecting accessible sites is employed. In some embodiments, optimization of 
a probe set design may require shifting of the target site to a newly identified accessible 
site. 

In some embodiments, e.g., wherein an accessible site has been identified yet 
probe set performance is low, a change in the design of a probe 5' arm may improve 
30 assay performance without altering the site targeted. In other embodiments, altering the 
length of an ARRESTOR oligonucleotide {e,g,, increasing the length of the portion that is 
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complementary to the 5' arm region of the probe) may reduce background signal, thus 
increasing the probe stet performance. 

Other variations on oligonucleotide design may be employed to alter performance 
in an assay. Some modifications may be employed to shift the ideal operating 
temperature of a probe set design into a preferred temperature range. For example, the 
use of shorter oligonucleotides and the incorporation of mismatches generally act to 
reduce the TmS, and thus reduce the ideal operating temperatures, of designed 
ohgonucleotides. Conversely, the use of longer oligonucleotides and the employment of 
stacking oligonucleotides generally act to increase the T^s, and thus increase the ideal 
operating temperatures of the designed oligonucleotides. 

Other modifications may be employed to alter other aspects of oligonucleotide 
performance in an assay. For example, the use of base analogs or modified bases can 
alter enzyme recognition of the oUgonucleotide. In some embodiments, such modified 
bases are used to protect a region of an oligonucleotide from nuclease cleavage. In other 
embodiments, modified bases are used to affect the ability of an oligonucleotide to 
participate as a member of a cleavage structure that is not in a position to be cleaved (e,g.^ 
to serve as an INVADER oligonucleotide to enable cleavage of a probe). These modified 
bases may be referred to as "blocker" or "blocking" modifications. In some 
embodiments, assay oligonucleotides incorporate 2'-0-methyl modifications. In other 
embodiments, assay oligonucleotides incorporate 3' terminal modifications (e.g., NH2, 3' 
hexanol, 3' phosphate, 3* biotin). 

In yet other embodiments, the components of the reaction may be altered to affect 
assay performance. For example, oligonucleotide concentrations may be varied. 
Oligonucleotide concentrations can affect multiple aspects of the reaction. Since melting 
temperatures of complexes are partly a function of the concentrations of the components 
of the complex, variation of the concentrations of the oligonucleotide components can be 
used as one facet of reaction optimization. In the methods of the present invention, 
ARRESTOR oligonucleotides may be used to modulate the availability of the primary 
probe oligonucleotides in an INVADER assay reaction. In some embodiments, an 
ARRESTOR oligonucleotide may be excluded. Other reaction components may also be 
varied, including enzyme concentration, salt and divalent ion concentration and identity. 
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VL Kits for performing the RNA INVADER Assay 



In some embodiments, the present invention provides kits comprising one or more 
5 of the components necessary for practicing the present invention. For example, the 
present invention provides kits for storing or deHvering the enzymes of the present 
invention and/or the reaction components necessary to practice a cleavage assay (e.g., the 
INVADER assay). The kit may include any and all components necessary or desired for 
the enzymes or assays including, but not limited to, the reagents themselves, buffers, 

10 control reagents (e.g., tissue samples, positive and negative control target 

oligonucleotides, etc.), solid supports, labels, written and/or pictorial instructions and 
product information, inhibitors, labeling and/or detection reagents, package 
environmental controls (e.g., ice, desiccants, etc.), and the like. In some embodiments, 
the kits provide a sub-set of the required components, wherein it is expected that the user 

15 will supply the remaining components. In some embodiments, the kits comprise two or 
more separate containers wherein each container houses a subset of the components to be 
dehvered. For example, a first container (e.g., box) may contain an enzyme (e.g., 
structure specific cleavage enzyme in a suitable storage buffer and container), while a 
second box may contain oUgonucleotides (e.g., INVADER oligonucleotides, probe 

20 oligonucleotides, control target oligonucleotides, etc.). In some embodiments one or 
more the reaction components may be provided in a predispensed format (i.e., 
premeasured for use in a step of the procedure without re-measurement or re-dispensing). 
In some embodiments, selected reaction components are mixed and predispensed 
together. In preferred embodiments, predispensed reaction components are predispensed 

25 and are provided in a reaction vessel (including but not limited to a reaction tube or a 
well, as in, e,g., a microtiter plate). In particularly preferred embodiments, predispensed 
reaction components are dried down {e.g., desiccated or lyophilized) in a reaction vessel. 

Additionally, in some embodiments, the present invention provides methods of 
deUvering kits or reagents to customers for use in the methods of the present invention. 

30 The methods of the present invention are not limited to a particular group of customers. 
Indeed, the methods of the present invention find use in the providing of kits or reagents 

163 



to customers in many sectors of the biological and medical community, including, but not 
limited to customers in academic research labs, customers in the biotechnology and 
medical industries, and customers in governmental labs. The methods of the present 
invention provide for all aspects of providing the kits or reagents to the customers, 

5 including, but not limited to, marketing, sales, delivery, and technical support. 

In some embodiments of the present invention, quality control (QC) and/or 
quality assurance (QA) experiments are conducted prior to delivery of the kits or reagents 
to customers. Such QC and QA techniques typically involve testing the reagents in 
experiments similar to the intended commercial uses (e.g., using assays similar to those 

1 0 described herein). Testing may include experiments to determine shelf life of products 
and their ability to withstand a wide range of solution and/or reaction conditions (e.g. , 
temperature, pH, light, etc.). 

In some embodiments of the present invention, the compositions and/or methods 
of the present invention are disclosed and/or demonstrated to customers prior to sale {e.g., 

1 5 through printed or web-based advertising, demonstrations, etc.) indicating the use or 

functionality of the present invention or components of the present invention. However, 
in some embodiments, customers are not informed of the presence or use of one or more 
components in the product being sold. In such embodiments, sales are developed, for 
example, through the improved and/or desired function of the product (e.g., kit) rather 

20 than through knowledge of why or how it works (i.e., the user need not know the 
components of kits or reaction mixtures). Thus, the present invention contemplates 
making kits, reagents, or assays available to users, whether or not the user has knowledge 
of the components or workings of the system. 

Accordingly, in some embodiments, sales and marketing efforts present 

25 information about the novel and/or improved properties of the methods and compositions 
of the present invention. In other embodiments, such mechanistic information is withheld 
from marketing materials. In some embodiments, customers are surveyed to obtain 
information about the type of assay components or delivery systems that most suits their 
needs. Such information is useful in the design of the components of the kit and the 

30 design of marketing efforts. 
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VII. The INVADER Assay for Direct Detection and Measurement of Specific 
RNA Anaiytes. 

The following section provides a few illustrative examples of mRNAs that may be 
detected or measured using the methods, compositions and systems of the present 
invention. 

HOUSEKEEPING CONTROLS 

RNAs that are generally present in predicable or invariant amounts in test samples 
provide useful control targets for detection assays. These controls can be useful in 
several ways, including but not limited to providing confirmation of the proper function 
of an assay, and as a standard against which a test result for another RNA can be 
compared or measured to aid in interpretation of a result. mRNAs for the following 
genes find particular use in the methods of the present invention. 
Human Ubiquitin and Mouse/Rat Ubiquitin 

The ubiquitin system is a major pathway for selective protein degradation. 
Degradation by this system is instrumental in a variety of cellular functions such as DNA 
repair, cell cycle progression, signal transduction, transcription, and antigen presentation. 
The ubiquitin pathway also eliminates proteins that are misfolded, misplaced, or that are 
in other ways abnormal. This pathway requires the covalent attachment of ubiquitin 
(El), a highly conserved 76 amino acid protein, to defined lysine residues of substrate 
proteins. 

Human, rat and mouse gIyceraldehyde-3-phosphate dehydrogenase (GAPDH) 

GAPDH is an important enzyme in the glycolysis and gluconeogenesis pathways. 
This homotetrameric enzyme catalyzes the oxidative phosphorylation of D- 
glyceraldehyde-3-phosphate to 1,3-diphosphoglycerate in the presence of cofactor and 
inorganic phosphate. A variety of diverse biological properties of GAPDH have been 
reported. These include functions in endocytosis, mRNA regulation, tRNA export, DNA 
replication, DNA repair, and neuronal apoptosis. 
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CYTOKINES 

A growing family of regulatory proteins that deliver signals between cells of the 
immune system has been identified. Called c3^okines, these proteins have been found to 
control the growth and development, and bioactivities, of cells of the hematopoietic and 
5 immune systems. Cytokines exhibit a wide range of biological activities with target cells 
from bone marrow, peripheral blood, fetal liver, and other }3aiiphoid or hematopoietic 
organs. The present invention describes methods for the detection of expression of 
cytokines, including but not limited to of the exemplary members of the cytokine family 
listed below. 

10 

J Human Oncostatin M 

^ Oncostatin M is a secreted single-chain polypeptide cytokine that regulates the 

..p growth of certain tumor-derived and normal cell lines. A number of cell types have been 

til found to bind the oncostatin M protein. While it has been shown to inhibit proliferation 

ff ^ 15 of a number of tumor cell types, it has also been implicated in stimulating proliferation of 
Q Kaposi's sarcoma cells. 

J Human transforming growth factor-beta (TGF-p) 

Transforming growth factor-beta (TGF-beta) is a member of a family of 
20 structurally-related cytokines that elicit a variety of responses, including growth, 

differentiation, and morphogenesis, in many different cell types. In vertebrates, at least 
five different forms of TGF-beta, termed TGF-betal to TGF-beta5, have been identified; 
they all share a high degree (60%-80%) of amino-acid sequence identity. While TGF- 
betal was initially characterized by its ability to induce anchorage-independent growth of 
25 normal rat kidney cells, its effects on most cell types are anti-mitogenic. It is strongly 
growth-inhibitory for many types of cells, including both normal and transformed 
epithelial, endothelial, fibroblast, neuronal, lymphoid, and hematopoietic cells. In 
addition, TGF-beta plays a central role in regulating the formation of extracellular matrix 
and cell-matrix adhesion processes. 

30 
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Human monocyte chemoattractant protein-1 (MCP-1) 

Within this family of cytokines, an emerging group of chemotactic cytokines, also 
called chemokines or intercrines, has been identified. Two subfamilies of chemokines 
have been recognized, alpha and beta, based on chromosomal location and the 
5 arrangement of the cysteine residues. 

The human genes encoding the beta subfamily proteins are located on 
chromosome 17 (their mouse counterparts are clustered on mouse chromosome 11, which 
is the counterpart of human chromosome 17), Homology in the beta subfamily ranges 
from 28-45% intraspecies, from 25-55% interspecies. An exemplary member is the 

10 human protein MCP-1 (monocyte chemoattractant protein-l). MCP-1 exerts several 

effects specifically on monocytes. It is a potent chemoattractant for human monocytes in 
vitro and can stimulate an increase in cytosolic free calcium and the respiratory burst in 
monocytes, MCP-1 has been reported to activate monoc>le-mediated tumoristatic 
activity, as well as to induce tumoricidal activity. MCP-1 has been implicated as an 

1 5 important factor in mediating monocytic infiltration of tissues inflammatory processes 
such as rheumatoid arthritis and alveolitis. The factor may also play a fundamental role 
in the recruitment of monocyte-macrophages into developing atherosclerotic lesions. 

Human tumor necrosis factor alpha (TNF-a) 

20 Tumor necrosis factor alpha (TNF-alpha also cachectin) is an important cytokine 

that plays a role in host defense. The cytokine is produced primarily in macrophages and 
monocytes in response to infection, invasion, injury, or inflammation. Some examples of 
inducers of TNF-alpha include bacterial endotoxins, bacteria, viruses, lipopolysaccharide 
(LPS) and cytokines including GM-CSF, IL-1, IL-2 and IFN-gamma. 

25 Despite the protective effects of the c34okine, overexpression of TNF-alpha often 

results in disease states, particularly in infectious, inflammatory and autoimmune 
diseases. This process may involve the apoptotic pathways. High levels of plasma TNF- 
alpha have been found in infectious diseases such as sepsis syndrome, bacterial 
meningitis, cerebral malaria, and AIDS; autoimmune diseases such as rheumatoid 

30 arthritis, inflammatory bowel disease (including Crohn's disease), sarcoidosis, multiple 
sclerosis, Kawasaki syndrome, graft-versus-host disease and transplant (allograft) 
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rejection; and organ failure conditions such as adult respiratory distress syndrome, 
congestive heart failure, acute liver failure and myocardial infarction. Other diseases in 
which TNF-alpha is involved include asthma, brain injury following ischemia, non- 
insulin-dependent diabetes mellitus, insulin-dependent diabetes mellitus, hepatitis, atopic 
dermatitis, and pancreatitis. Further, inhibitors of TNF-alpha have been suggested to be 
useful for cancer prevention. Elevated TNF-alpha expression may also play a role in 
obesity. TNF-alpha was found to be expressed in human adipocytes and increased 
expression, in general, correlated with obesity. 

Human interleukin-6 (IL-6) 

IL-6 is the standardized name of a cytokine called B lymphocyte differentiating 
factor, interferon beta2, 26 Kd protein, hybridomayplasmacytoma growth factor, 
hepatocyte stimulating factor, etc. 

IL-6 induces activated B cells to be differentiated into antibody forming cells. 
For T cells, IL-6 induces T cells stimulated by mitogens to produce IL-2 and induces the 
expression of IL-2 receptor on a certain T cell Une or thymocytes. For blood forming 
cells, IL-6 induces the growth of blood forming stem cells synergistically in the presence 
of IL-3. Fxuthermore, recently, it was reported that IL-6 acted like thrombopoietin. 

IL-6 is produced by various cells. It is produced by lymphocytes and is also 
produced by human fibroblasts stimulated by Poly (I)-Poly (C) and cycloheximide. 
Murine IL-6 is produced in mouse cells, which are stimulated by Poly (A)-Poly (U). 
Inducers for stimulation are diverse, and include known cytokines such as IL-1, TNF and 
IFN-beta, growth factors such as PDGF and TGF-beta, LPS, PMA, PHA, cholera toxin, 
etc. Moreover, it is reported that human vascular endotheUal cells, macrophages, human 
glioblastomas, etc. also produce IL-6. Furthermore, it is also known that the productivity 
can be further enhanced by stimulating cells using an inducer and subsequently treating 
the cells by a metabolic inhibitor such as verapamil, cycloheximide or actinomycin D, 
etc. 
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Human interleukin Ibeta (IL-IP) 

Interleukin-l (IL-1) is important to the activation of T and B lymphocytes and 
mediates many inflammatory processes. Two distinct forms of IL-1 have been isolated 
and expressed; termed IL-lbeta and IL-1 alpha. IL-lbeta is the predominant form 
produced by human monocytes both at the mRNA and protein level. The two forms of 
human IL-1 share only 26% amino acid homology. Despite their distinct polypeptide 
sequences, the two forms of IL-1 have structural similarities, in that the amino acid 
homology is confined to discrete regions of the IL-1 molecule. The two forms of IL-1 
also possess identical biological properties, including induction of fever, slow wave 
sleep, and neutrophiha, T- and B-lymphocyte activation, fibroblast proliferation, 
cytotoxicity for certain cells, induction of coUagenases, synthesis of hepatic acute phase 
proteins, and increased production of colony stimulating factors and collagen. IL-1 also 
activates endotheUal cells, resulting in increased leukocyte adhesiveness, PGI2 and PGE2 
(prostaglandins) release, and synthesis of platelet activating factor, procoagulant activity, 
and a plasminogen activator inhibitor. Clearly, IL-1 plays a central role in local and 
systemic host responses. Because many of the biological effects of IL-1 are produced at 
picomolar (pg) concentrations in vivo, IL-1 production is likely a fundamental 
characteristic of host defense mechanisms. 

Human interleukin 2 (IL-2) 

Interleukin-2 (IL-2) is the main growth factor of T lymphocytes. By regulating T 
helper lymphocyte activity IL-2 increases the humoral and cellular inmiune responses. 
By stimulating cytotoxic CDS T cells and NK cells, this cytokine participates in the 
defense mechanisms against tumors and viral infections. IL-2 is used in therapy agamst 
metastatic melanoma and renal adenocarcinoma. IL-2 is used in clinical trials in many 
forms of cancer. It is also used in HIV infected patients and leads to a significant 
increase in CD4 counts. Human IL-2 is a protein of 133 amino acids (aa) composed of 
four alpha helices connected by loops of various length, its tridimensional structure has 
been estabhshed. IL-2R is composed of three chains alpha, beta and gamma. IL2Ralpha 
controls the affinity of the receptor IL-2Rbeta and IL-2Rgamma are responsible for IL-2 
signal transduction. The different molecular areas of IL-2 interacting with the three 
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chains of the IL-2 R have been defined. More specifically it has been determined that a 
helix A as well as the NH2 terminal area of IL-2 (residues 1 to 30) control the interactions 
IL-2/IL-2Rbeta. 

5 Human interleukin 8 (IL-8) 

Human IL-8 is a cytokine that has variously been called neutrophil-activating 
protein, neutrophil chemotactic factor (NCF) and T-cell chemotactic factor. IL-8 can be 
secreted by several types of cells upon appropriate stimulation, IL-8 is secreted by 
activated monoc3^es and macrophages as well as by embryonic fibroblasts. 

10 IL-8 is known to induce neutrophil migration and to activate functions of 

neutrophils such as degranulation, release of superoxide anion and adhesion to the 
endothehal cell monolayer. There are a number of conditions that are known to involve 
leukocyte infiltration into lesions. These include pulmonary diseases such as pulmonary 
cystic fibrosis, idiopathic pulmonary fibrosis, adult respiratory distress syndrome, 

15 sarcoidosis and empyema; dermal diseases such as psoriasis, rheumatoid arthritis; and 
inflammatory bowel disease (Crohn's Disease), 

The amino acid sequence characterizing human IL-8 was described by 
Matsushima, et al. in PCT appUcation WO89/08665. More recently, it was shown that 
monocjle-derived IL-8 was evidently variably processed at the N-terminus and that the 

20 IL-8 originally disclosed by Matsushima et al. was accompanied by two forms of the 
factor which had seven or five additional amino acids at the N-terminus (Yoshimura, et 
al, Mol Immunol 26:87 [1989]). The longest form accounted for about 8%, the next 
longest form for about 47%, and the shortest form for about 45% of the total IL-8 derived 
fi*om monocytes. 

25 

Human interleukin 10 (IL-10) 

Interleukin- 10 (IL-10), a recently discovered lymphokine, was originally 
described as an inhibitor of interferon-gamma synthesis and is postulated as a major 
mediator of the humoral class of immune response. Two classes of often mutually 
30 exclusive immune responses are the humoral (antibody-mediated) and the delayed-type 
hypersensitivity. 



170 



It is postulated that these two differing immune responses may arise from two 
types of helper T-cell clones, namely Thl and Th2 helper T-cells, which demonstrate 
distinct cytokine secretion patterns. Mouse Thl cell clones secrete interferon-gamma, 
and IL-2 and preferentially induce the delayed-type hypersensitivity response while Th-2 
5 cell clones secrete IL-4, IL-5 and IL-10 and provide support for the humoral responses. 
The contrast in immune response could result because interferon-gamma secreted by the 
Thl cell clones inhibits Th2 clone proliferation in vitro, while IL-IO secreted by the Th2 
cell clones inhibits cytokine secretion by the Thl cell clones. Thus the two T-helper cell 
types may be mutually inhibitory and may provide the underpinning for the two 
1 0 dissimilar immune responses. 

IL-10 has been cloned and sequenced from both murine and human T cells. Both 
sequences contain an open reading frame encoding a polypeptide of 178 amino acids with 
an N-terminal hydrophobic leader sequence of 18 amino acids, and have an amino acid 
sequence homology of 73%. 

15 

Human interleukin 4 (IL-4) 

Interleukin-4 (IL-4, also known as B cell stimulating factor, or BSF-1) was 
originally characterized by its ability to stimulate the proliferation of B cells in response 
to low concentrations of antibodies directed to surface immunoglobulin. More recently, 

20 IL-4 has been shown to possess a far broader spectrum of biological activities, including 
growth co-stimulation of T cells, mast cells, granulocytes, megakaryocytes, and 
erythrocytes. In addition, IL-4 stimulates the proliferation of several IL-2- and IL-3- 
dependent cell lines, induces the expression of class II major histocompatibility complex 
molecules on resting B cells, and enhances the secretion of IgE and IgGl isotypes by 

25 stimulated B cells. Both murine and human IL-4 have been definitively characterized by 
recombinant DNA technology and by purification to homogeneity of the natural murine 
protein. 

The biological activities of IL-4 are mediated by specific cell surface receptors for 
IL-4 that are expressed on primary cells and in vitro cell lines of mammalian origin. IL-4 
30 binds to the receptor, which then transduces a biological signal to various inomune 
effector cells. 
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Human interferon gamma (IFN-y) 

Like the interleukinSj interferons belong to the class of the cytokines and are listed 
in various classes: interferon-alpha, interferon-beta, interferon-ganima, interferon-omega 
5 and interferon-tau. Interferon-ganima is a glycoprotein, the amino acid sequence of 
which has been known since 1982. In the mature condition the interferon-gamma has 
143 amino acids and a molecular weight of 63 to 73 kilodaltons. 

The tertiary and quaternary structure of the non-glycosylised protein was clarified 
in 199 L According to this, interferon-gamma exists as a homodimer, the monomers 

10 being orientated in contrary directions in such a way that the C-terminal end of one 

monomer is located in the vicinity of the N-terminal end of the other monomer. Each of 
these monomers in all has six alpha-hehces. Interferon-gamma is also called 
immunointerferon, as it has non-specific antiviral, antiproliferative and in particular 
immunomodulatory effects. Its production in T-helper-lymphocytes is stimulated by 

15 mitogens and antigens. The effect of the expressed interferon-ganmia has not yet been 
precisely clarified, but is subject to intensive research. In particular, interferon-gamma 
leads to the activation of macrophages and to the synthesis of histocompatability antigens 
of the class 2. In vitro, the activity of interferon-gamma is normally determined as a 
reduction in the virus-induced cytopathic effect, which arises from treatment with 

20 interferon-gamma. Due to its antigen-non-specific antiviral, antiproliferative and 

immunomodulatory activity it is suitable as a human therapeutic agent, for example of 
kidney tumours and chronic granulomatosis. 

CYTOCHROME P450s 

25 The term cytochrome P-450 refers to a family of enzymes (located on the 

endoplasmic reticulum, with high concentrations of the proteins in the cells of the liver 
and small intestine) that are primarily responsible for the metabolism of xenobiotics such 
as drugs, carcinogens and environmental chemicals, as well as several classes of 
endobiotics such as steroids and prostaglandins. Members of the cytochrome P450 

30 family are present in varying levels and their expression and activities are controlled by 
variables such as chemical environment, sex, developmental stage, nutrition and age. 
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More than 200 cytochrome P450 genes have been identified. There are multiple 
forms of these P450 genes and each of the individual forms exhibit degrees of specificity 
towards individual chemicals in the above classes of compounds. In some cases, a 
substrate, whether it be drug or carcinogen, is metabolized by more then one of the 
5 cytochromes P450, Genetic polymorphisms of cytochromes P450 result in 
phenotypically-distinct subpopulations that differ in their ability to perform 
biotransformations of particular drugs and other chemical compounds. 

The present invention provides methods for the detection cytochrome P450 
mRNAs, including but not limited to, Human CYP 1 Al, Human CYP 1 A2, Human CYP 
10 2B1, Human CYP 2B2, Human CYP 2B6, Human CYP 2C19, Human CYP 2C9, Human 
CYP 2D6, Human CYP 3A4, Human CYP 3A5, Human CYP 3A7, Rat CYP 2E1, Rat 
CYP 3A1, Rat CYP 3A2, Rat CYP 4A1, Rat CYP 4A2 and Rat CYP 4A3. 
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EXAMPLES 



The following examples serve to illustrate certain preferred embodiments and 
aspects of the present invention and are not to be construed as limiting the scope thereof. 
5 In the disclosure which follows, the following abbreviations apply: Afu 

{Archaeoglobus fulgidus); Mth (Methanobacterium thermoautotrophicum); Mja 
(Methanococcus jannaschii); Ffa {Pyrococcus furiosus)\ Pwo (Pyrococcus woesei)\ Taq 
{Thermus aquaticus); Taq DNAP, DNAPra^, and Taq Pol I (T aquaticus DNA 
polymerase I); DNAPStf (the Stoffel fragment of DNAPJa^); DNAPEcl {E. coli DNA 

10 polymerase I); Tth {Thermus thermophilus); Ex, (Example); Fig. (Figure);'^C (degrees 
Centigrade); g (gravitational field); hr (hour); min (minute); olio (oligonucleotide); rxn 
(reaction); vol (volume); w/v (weight to volume); v/v (volume to volume); BSA (bovine 
serum albumin); CTAB (cetyltrimethylammonium bromide); HPLC (high pressure liquid 
chromatography); DNA (deoxyribonucleic acid); p (plasmid); \il (microliters); ml 

15 (milliliters); pig (micrograms); mg (milligrams); M (molar); mM (miliiMolar); jaM 
(microMolar); pmoles (picomoles); amoles (attomoles); zmoles (zeptomoles); 
nm (nanometers); kdal (kilodaltons); OD (optical density); EDTA (ethylene diamine 
tetra-acetic acid); FITC (fluorescein isothiocyanate); SDS (sodium dodecyl sulfate); 
NaP04 (sodium phosphate); NP-40 (Nonidet P-40); Tris (tris(hydroxymethyl)- 

20 aminomethane); PMSF (phenylmethylsulfonylfluoride); TBE (Tris-Borate-EDTA, i.e,, 
Tris buffer titrated with boric acid rather than HCl and containing EDTA); 
PBS (phosphate buffered saline); PPBS (phosphate buffered saline containing 1 mM 
PMSF); PAGE (polyacrylamide gel electrophoresis); Tween (polyoxyethylene-sorbitan); 
ATCC (American Type Culture Collection, Rockville, MD); Coriell (Coriell Cell 

25 Repositories, Camden, NJ); DSMZ (Deutsche Sammlung von Mikroorganismen und 

Zellculturen, Braunschweig, Germany); Ambion (Ambion, Inc., Austin, TX); Boehringer 
(Boehringer Mannheim Biochemical, Indianapolis, IN); MJ Research (MJ Research, 
Watertown, MA; Sigma (Sigma Chemical Company, St. Louis, MO); Dynal (Dynal A.S., 
Oslo, Norway); Gull (Gull Laboratories, Salt Lake City, UT); Epicentre (Epicentre 

30 Technologies, Madison, WI); Lampire (Biological Labs., Inc., Coopersberg, PA); MJ 
Research (MJ Research, Watertown,MA); National Biosciences (National Biosciences, 
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Plymouth, MN); NEB (New England Biolabs, Beverly, MA); Novagen (Novagen, Inc., 
Madison, WI); ; Promega (Promega, Corp., Madison, WI); Stratagene (Stratagene 
Cloning Systems, La Jolia, CA); Clonetech (Clonetech, Palo Alto, CA) Pharmacia 
(Pharmacia, Piscataway, NJ); Milton Roy (Milton Roy, Rochester, NY); Amersham 

5 (Amersham International, Chicago, IL); and USB (U.S. Biochemical, Cleveland, OH). 
Glen Research (Glen Research, Sterling, VA); Coriell (Coriell Cell Repositories, 
Camden, NJ); Gentra (Centra, Minneapolis, MN); Third Wave Technologies (Third 
Wave Technologies, Madison, WI); PerSeptive Biosystems (PerSeptive Biosystems, 
Framington, MA); Microsoft (Microsoft, Redmond, WA); Qiagen (Qiagen, Valencia, 

10 CA); Molecular Probes (Molecular Probes, Eugene, OR); VWR (VWR Scientific, ); 
Advanced Biotechnologies (Advanced Biotechnologies, INC, Columbia, MD); and 
Perkin Elmer (also known as PE Biosytems and Applied Biosystems, Foster City, CA). 

EXAMPLE 1 

15 Rapid screening of colonies for 5' nuclease activity 

The native 5' nucleases and the enzymes of the present invention can be tested 
directly for a variety of fiinctions. These include, but are not limited to, 5' nuclease 
activity on RNA or DNA targets and background specificity using alternative substrates 

20 representing structures that may be present in a target detection reaction. Examples of 
nucleic acid molecules having suitable test structures are shown schematically in Figures 
18A-D and Figures 21-24, The screening techniques described below were developed to 
quickly and efficiently characterize 5' nucleases and to determine whether the new 5' 
nucleases have any improved or desired activities. Enzymes that show improved cycling 

25 rates on RNA or DNA targets, or that result ui reduced target-independent cleavage merit 
more thorough investigation. In general, the modified proteins developed by random 
mutagenesis were tested by rapid colony screen on the substrates shown in Figures 18A 
and 1 8B. A rapid protein extraction was then done, and a test of activity on alternative 
structures, {e.g,, as shown in Figures 18C-D) was performed using the protein extract, 

30 Either the initial screen, or further screening and characterization of enzymes for 
improved activity may be performed using other cleavage complexes, such as those 
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diagrammed in Figures 21-24. It is not intended that the scope of the invention be limited 
by the particular sequences used to form such test cleavage structures. One skilled in the 
art would understand how to design and create comparable nucleic acids to form 
analogous structures for rapid screening. 

5 This order of testing may be chosen to reduce the number of tests overall, to save 

time and reagents. The order of testing for enzyme function is not intended to be a 
limitation on the present invention. Those mutants that showed reasonable cycling rates 
with the RNA or DNA targets may then be cultured overnight, and a rapid protein 
extraction done. Alternatively, any subset or all of the cleavage tests may be done at the 

10 same time. 

For convenience, each type of rapid screen may be done on a separate microtiter 
plate. For example, one plate may be set up to test RNA INVADER activity, one plate 
set up to test for DNA INVADER activity. As many as 90 different colonies may be 
screened on one plate. The colonies screened can be from a variety of sources, such as 
15 clones of unaltered (native) 5' nucleases, from one mutagenesis reaction {e.g., many 
colonies from a single plate) or from a variety of reactions (colonies selected from 
multiple plates). 

Ideally, positive and negative controls should be run on the same plate as the 
mutants, using the same preparation of reagents. One example of a good positive confrol 

20 is a colony containing the unmodified enzyme, or a previously modified enzyme whose 
activity is to be compared to new mutants. For example, if a mutagenesis reaction is 
performed on the Taq DN RX HT construct (described below), the unmodified Taq DN 
RX HT construct would be chosen as the standard for comparing the effects of 
mutagenesis on enzymatic activity. Additional control enzymes may also be incorporated 

25 into the rapid screening test. For example, Tth DN RX HT (described below; unless 
otherwise specified, the TaqPol and TthPol enzymes of the following discussion refer to 
the DN RX HT derivative) may also be included as a standard for enzymatic activity 
along with the Taq DN RX HT. This would allow a comparison of any altered enzymes 
to two known enzymes having different activities. A negative conti:ol should also be run 

30 to determine the background reaction levels (i. e. , cleavage or probe degradation due to 

sources other than the nucleases being compared). A good negative control colony would 
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be one containing only the vector used in the cloning and mutagenesis, for example, 
colonies containing only the pTrc99A vector. 

Two factors that may influence the number of colonies chosen from a specific 
mutagenesis reaction for the initial rapid screen are 1) total number of colonies obtained 
from the mutagenesis reaction, and 2) whether the mutagenesis reaction was site-specific 
or randomly distributed across a whole gene or a region of a gene. For example, if only 
5-10 colonies are present on the plate, all colonies can easily be tested. If hundreds of 
colonies are present, a subset of these may be analyzed. Generally 10-20 colonies are 
tested from a site-specific mutagenesis reaction, while 80 to 100 or more colonies are 
routinely tested from a single random mutagenesis reaction. 

Where indicated, the altered 5' nucleases described in these experimental 
examples were tested as detailed below. 

A. Rapid screen: INVADER activity on RNA target (Figure 18 A) 

A 2X substrate mix was prepared, comprising 20 mM MOPS, pH 7,5, 10 mM 
MgS04, 200 mM KCl, 2 iiM FRET-probe oligo SEQ ID NO:223 
(5'-Fl-CGCT-cy3-TCTCGCTCGC-3'), 1 ^M INVADER oUgo SEQ ID NO:224 
(5'-ACGGAACGAGCGTCTTTG-3'), and 4 nM RNA target SEQ ID NO:225 (5'-GCG 
AGC GAGA CAG CGA AAG ACG CUC GUU CCG U-3'). Five ^il of the 2X substrate 
mix were dispensed into each sample well of a 96 well microtiter plate (Low Profile 
MULTIPLATE 96, MJ. Research, Inc.)^ 

Cell suspensions were prepared by picking single colonies (mutants, positive 
control, and negative control colonies) and suspending each one in 20|al of water. This 
can be done conveniently in a 96 well microtiter plate format, using one well per colony. 

Five [il of the cell suspension was added to the appropriate test well such that the 
final reaction conditions were 10 mM MOPS, pH 7,5, 5 mM MgS04, 100 mM KCl, 1 ^M 
FRET-probe oligo, 0,5 [iM INVADER oUgo, and 2 nM RNA target. The wells were 
covered with 10 \xl of Clear CHILLOUT 14 (MJ. Research, Inc.) Uquid wax, and the 
samples were heated at SS^'C for 3 minutes, then incubated at 59°C for 1 hour. After the 
incubation, the plates were read on a Cytofluor flourescense plate reader using the 
following parameters: excitation 485/20, emission 530/30. 
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B. Rapid screen: INVADER activity on DNA target (Figure 18B) 

A 2X substrate mix was prepared, comprising 20 mM MOPS, pH 7.5, 10 mM 
MgS04, 200 mM KCl, 2 ^iM FRET-probe oligo SEQ ID NO:223 

5 (5'-Fl-CGCT-Cy3-TCTCGCTCGC-3'), 1 ^iM INVADER oligo SEQ ID NO:224 
(5'-ACGGAACGAGCGTCTTTG-3'), 1 nM DNA target SEQ ID NO:226 (5'-GCG 
AGC GAGA CAG CGA AAG ACG CTC GTT CCG T-3'). Five \i\ of the 2X substrate 
mix was dispensed into each sample well of a 96 well microtiter plate (MJ Low Profile). 
Cell suspensions were prepared by picking single colonies (mutants, positive 

10 control and negative control colonies) and suspending them in 20 (j,l of water, generally 
in a 96 well microtiter plate format. 

5 III of the cell suspension were added to the appropriate test well such that the 
final reaction conditions were 10 mM MOPS, pH 7.5, 5 mM MgS04, 100 mM KCl, 1 ^iM 
FRET-probe ohgo, 0.5 ^iM INVADER oligo, and 0.5 nM DNA target. Wells were 

15 covered with 10|a 1 of Clear CHILLOUT 14 (M.J. Research, Inc.) liquid wax, and the 
reactions were heated at 85°C for 3 minutes, then incubated at 59°C for 1 hour. After the 
hour incubation, the plate were read on a Cytofluor flourescan plate reader using the 
following parameters: excitation 485/20, emission 530/30, gain 40, reads per well 10. 

20 C. Rapid protein extraction (crude cell lysate) 

Those mutants that gave a positive or an unexpected result in either the RNA or 
DNA INVADER assay were further analyzed, specifically for background activity on the 
X-structure or the hairpin substrate (Figure ISC and D, respectively). A rapid colony 
screen format can be employed, as described above. By simply changing the substrate, 

25 tests for background or aberrant enzymatic activity can be done. Another approach 
would be to do a rapid protein extraction from a small overnight culture of positive 
clones, and then test this crude cell lysate for additional protein function. One possible 
rapid protein extiraction procedure is detailed below. Two to five ml of LB (containing 
the appropriate antibiotic for plasmid selection; See e.g., Maniatis, books 1,2 and 3) were 

30 inoculated with the remaining volume of the 20 \il water-cell suspension and incubated at 
37°C overnight. About 1.4 ml of the culture were ti^sferred to a 1.5 ml microcentiifuge 
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tube, and microcentrifuged at top speed {e.g., 14,000 rpm in an Eppendorf 5417 table top 
microcentrifuge), at room temperature for 3-5 minutes to pellet the cells. The supernatant 
was removed, and the cell pellet was suspended in 100 ^1 of TES buffer pH 7.5 (Sigma). 
Lysozyme (Promega) was added to a final concentration of 0.5)ag/}il and samples were 
5 incubated at room temperature for 30 minutes. Samples were then heated at 70°C for 10 
minutes to inactivate the lysozyme, and the cell debris was pelleted by 
microcentrifugation at top speed for 5 minutes. The supernatant was removed and this 
crude cell lysate was used in the following enzymatic activity assays. 

D. Rapid screen: background specificity X structure substrate (Figure 18C) 
Reactions were performed under conditions as detailed above. One jal of crude 

cell lysate was added to 9 of reaction components for a final volume of 10 p.1 and final 
concentrations of 10 mM MOPS, pH 7.5, 5 mM MgS04, 100 mM KCl, 1 ^iM 
FRET-probe oligo (SEQ ID NO:223), O.S^iM X-structure INVADER oligo SEQ ID 
NO:227 (5'-ACGGAACGAGCGTCTTTCATCTGTCAATC-3'), and 0.5 nM DNA 
target (SEQ ID NO:226). Wells were covered with 10 ^il of Clear CHILLOUT 14 (M.J. 
Research, Inc.) liquid wax, and the reactions were heated at 85°C for 3 minutes, then 
incubated at 59°C for 1 hour. After the incubation, the plates were read on a Cytofluor 
fluorescence plate reader using the following parameters: excitation 485/20, emission 
530/30, gain 40, reads per well 10. 

E. Rapid screen: background specificity hairpin substrate (Figure 18D) 

Reactions were performed under conditions as detailed above. One \i\ of crude 
cell lysate was added to 9 yd of reaction components for a final volume of 10 |al and fmal 
25 concentrations of 10 mM MOPS, pH 7.5, 5 mM MgS04, 100 mM KCl, 1 (xM 

FRET-probe oligonucleotide (SEQ ID NO:223), and 0.5 nM DNA target (SEQ ID 
NO:226). Wells were covered with 10 |al of Clear CHILLOUT 14 (M.J. Research, Inc.) 
liquid wax, and the reactions were heated at 85°C for 3 minutes, then incubated at 59°C 
for 1 hour. After the hour incubation, the plate were read on a Cytofluor plate reader 
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using the following parameters: excitation 485/20, emission 530/30, gain 40, reads per 
well 10. 

F. Activity assays with IrTl and IdT targets (Figures 24) 

5 The 5' nuclease activities assays were carried out in 1 0 |al of a reaction containing 

10 mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% Nonidet P-40, 10 \ig/m\ tRNA, 100 mM 
KCl and 5 mM MgS04. The probe concentration (SEQ ID NO: 167) was 2 mM. The 
substrates (IrTl (SEQ ID NO: 228) or IdT (SEQ ID NO: 229) at 10 or 1 nM final 
concentration respectively) and approximately 20 ng of an enzyme, prepared as in 

1 0 Example 3 , were mixed with the above reaction buffer and overlaid with CHILLOUT 
(MJ Research) Uquid wax. Reactions were brought up to reaction temperature 57 °C, 
started by addition of MgS04, and incubated for 10 min. Reactions were then stopped by 
the addition of 10 |xl of 95% formamide containing 10 mM EDTA and 0.02% methyl 
violet (Sigma). Samples were heated to 90*'C for 1 minute immediately before 

15 electrophoresis through a 20% denaturing acrylamide gel (19:1 cross-linked), with 7 M 
urea, and in a buffer of 45 mM Tris-borate, pH 8.3, 1.4 mM EDTA. Unless otherwise 
indicated, 1 ^1 of each stopped reaction was loaded per lane. Gels were then scanned on 
an FMBIO-100 fluorescent gel scanner (Hitachi) using a 505 nm filter. The fraction of 
cleaved product was determined from intensities of bands corresponding to uncut and cut 

20 substrate with FMBIO Analysis software (version 6.0, Hitachi). The fraction of cleavage 
product did not exceed 20% to ensure that measurements approximated initial cleavage 
rates. The turnover rate was defined as the number of cleaved signal probes generated 
per target molecule per minute under these reaction conditions (1/min). 

25 G. Activity assays with X structure (X) and hairpin (HP) targets (Figures 22) 

The 5' nuclease activity assays were carried out in 10 )xl of a reaction containing 
10 mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% Nonidet P-40, 10 ^g/ml tRNA, 100 mM 
KCl and 5 mM MgS04. Each oligo for formation of either the hairpin structure assembly 
(22A, SEQ ID NOS: 230 and 231) assembly or the X structure assembly (22B, SEQ ID 
30 NOS: 230-232) was added to a final concentration of 1 |am, and approximately 20 ng of 
test enzyme prepared as described in Example 3, were mixed with the above reaction 
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buffer and overlaid with CHILLOUT (MJ Research) liquid wax. Reactions were brought 
up to reaction temperature 60 "C, started by addition of MgS04, and incubated for 10 
min. Reactions were then stopped by the addition of 10 )xl of 95% formamide containing 
10 mM EDTA and 0.02% methyl violet (Sigma). Samples were heated to 90°C for 1 

5 minute immediately before electrophoresis through a 20% denaturing acrylamide gel 
(19:1 cross-hnked), with 7 M urea, and in a buffer of 45 mM Tris-borate, pH 8.3, 1.4 mM 
EDTA. Unless otherwise indicated, 1 |xl of each stopped reaction was loaded per lane. 
Gels were then scanned on an FMBIO-100 fluorescent gel scanner (Hitachi) using a 505 
nm fiher. The fraction of cleaved product was determined from intensities of bands 

10 corresponding to uncut and cut substrate with FMBIO Analysis software (version 6.0, 
Hitachi). The fraction of cleavage product did not exceed 20% to ensure that 
measurements approximated initial cleavage rates. The turnover rate was defined as the 
number of cleaved signal probes generated per target molecule per minute under these 
reaction conditions (1/min). 

15 

H. Activity assays with human IL-6 target (Figure 10) 

The 5' nuclease activities assays were carried out in 10 ^1 reactions containing 10 
mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% Nonidet P-40, 10 }ig/ml tRNA, 100 mM 
KCl and 5 mM MgS04. Reactions comprising the DNA IL-6 substrate contained 0.05 

20 nM IL-6 DNA target (SEQ ID NO: 163) and 1 ixM of each probe (SEQ ID NO: 162) and 
INVADER (SEQ ID NO: 161) oligonucleotides, and were carried out at 60°C for 30 min. 
Reactions comprising the IL-6 RNA target (SEQ ID NO: 160) were performed under the 
same conditions, except that the IL-6 RNA target concentration was 1 nM and the 
reactions were performed at 57°C for 60 min. Each reaction contained approximately 20 

25 ng of test enzyme, prepared as described in Example 3. 

I. Activity assays with synthetic r25mer target (Figure 23) 

Reactions comprising the synthetic r25mer target (SEQ ID NO: 233) were carried 
out under the same reaction conditions (10 mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% 
30 Nonidet P-40, 10 ^g/ml tRNA, 100 mM KCl and 5 mM MgS04) and 1 ^iM of each probe 
(SEQ ID NO: 234) and INVADER (SEQ ID NO: 235) oUgonucleotides, except that the 
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r25mer target concentration was 5 nM and the reactions were performed at 58°C for 60 
min. Approximately 20 ng of each test enzyme was added to the reactions. Enzymes 
were prepared as described in Example 3. 

5 Any of the tests described above can be modified to derive the optimal conditions 

for enzymatic activity. For example, enzyme titrations can be done to determine the 
optimal enzyme concentration for maximum cleavage activity, and lowest background 
signal By way of example, but not by way of limitation, many of the mutant enzymes 
were tested at 10, 20 and 40 ng amounts. Similarly, a temperature titration can also be 
10 incorporated into the tests. Since modifying the structure of a protein can alter its 

temperature requirements, a range of temperatures can be tested to identify the condition 
best suited for the mutant in question. 

Examples of the results from such screens (using approximately 20 ng of the 
mutant enzyme) are shown in Tables 3-8, and Figures 12, 14, 15, 19, and 25. 

15 

EXAMPLE 2 

Cloning and Expression of 5* nucleases of DNA polymerases and mutant 

polymerases 

20 A, DNA polymerases of Thermus aquaticus and Thermus thermophilus 
L Cloning of TaqPol and TthPol 

Type A DNA polymerases from eubacteria of the genus Thermus share extensive 
protein sequence identity (90% in the polymerization domain, using the Lipman-Pearson 

25 method in the DNA analysis software from DNAStar, WI) and behave similarly in both 
polymerization and nuclease assays. Therefore, the genes for the DNA polymerase of 
Thermus aquaticus (TaqPol), Thermus thermophilus (TthPol) and Thermus scotoductus 
were used as representatives of this class. Polymerase genes from other eubacterial 
organisms, including, but not limited to, Escherichia coli, Streptococcus pneumoniae, 

30 Mycobacterium smegmatis, Thermus thermophilus, Thermus sp,, Thermotoga maritima, 
Thermosipho africanuSy and Bacillus stearothermophilus are equally suitable. 
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a. Initial TaqPol Isolation: mutant TaqA/G 

The Taq DNA polymerase gene was amplified by polymerase chain reaction from 
genomic DNA from Thermus aquaticus, strain YT-1 (Lawyer et al, supra), using as 

5 primers the oligonucleotides described in SEQ ID NOS :236 and 237. The resulting 
fragment of DNA has a recognition sequence for the restriction endonuclease EcoRi at 
the 5' end of the codmg sequence and a Bglil sequence at the 3' end of the coding strand. 
Cleavage with BgRl leaves a 5' overhang or "sticky end" that is compatible with the end 
generated by BamYil. The PCR-ampUfied DNA was digested with EcdRl and Bamm. 

10 The 25 12 bp fragment containing the coding region for the polymerase gene was gel 
purified and then ligated into a plasmid that contains an inducible promoter. 

In one embodiment of the invention, the pTTQlS vector, which contains the 
hybrid trp-lac (tac) promoter, was used (M.J.R. Stark, Gene 5:255 [1987]). The tac 
promoter is under the control of the E. coli lac repressor protein. Repression allows the 

1 5 synthesis of the gene product to be suppressed until the desired level of bacterial growth 
has been achieved, at which point repression is removed by addition of a specific inducer, 
isopropyl-b-D-thiogalactopyranoside (IPTG). Such a system allows the controlled 
expression of foreign proteins that may slow or prevent growth of transformants. 

Particularly strong bacterial promoters, such as the synthetic Vtac, may not be 

20 adequately suppressed when present on a multiple copy plasmid. If a highly toxic protein 
is placed under control of such a promoter, the small amount of expression leaking 
through, even in the absence of an inducer, can be harmful to the bacteria, hi another 
embodiment of the invention, another option for repressing synthesis of a cloned gene 
product is contemplated. A non-bacterial promoter from bacteriophage T7, found in the 

25 plasmid vector series pET-3, was used to express the cloned mutant Taq polymerase 
genes (Studier and Moffatt, J. Mol. Biol., 189:113 [1986]). This promoter initiates 
transcription only by T7 RNA polymerase. In a suitable strain, such as 
BL21(DE3)pLYS, the gene for the phage T7 RNA polymerase is carried on the bacterial 
genome under control of the lac operator. This arrangement has the advantage that 

30 expression of the multiple copy gene (on the plasmid) is completely dependent on the 
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expression of T7 RNA polymerase, which is easily suppressed because it is present in a 
single copy. 

These are just two examples of vectors having suitable inducible promoters. 
Others are well known to those skilled in the art, and it is not intended that the improved 
nucleases of the present invention be limited by the choice of expression system. 

For ligation into the pTTQlS vector, the PGR product DNA containing the Taq 
polymerase coding region (termed maiTaq for reasons discussed below, SEQ ID 
NO:238) was digested with EcoRI and Bglll and this fragment was ligated under 
standard "sticky end" conditions (Sambrook et al. Molecular Cloning, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, pp. 1.63-1.69 [1989]) into the EcoKL and 
Bamm sites of the plasmid vector pTTQlS. Expression of this construct yields a 
translational fusion product in which the first two residues of the native protein 
(Met-Arg) are replaced by three from the vector (Met-Asn-Ser), but the remainder of the 
PCR product's protein sequence is not changed (SEQ ID NO:239). The construct was 
transformed into the JM109 strain of coli, and the transformants were plated under 
incompletely repressing conditions that do not permit growth of bacteria expressing the 
native protein. These plating conditions allow the isolation of genes containing 
pre-existing mutations, such as those that result from the infideUty of Taq polymerase 
during the amplification process. 

Using this amplification/selection protocol, a clone was isolated containing a 
mutated Taq polymerase gene {mxATaq). The mutant was first detected by its phenotype, 
in which temperature-stable 5' nuclease activity in a crude cell extract was normal, but 
polymerization activity was ahnost absent (approximately less than 1% of wild type Taq 
polymerase activity). Polymerase activity was determined by primer extension reactions. 
The reactions were carried out in 10 \xX of buffer containing 10 mM MOPS, pH 7.5, 5 
mM MgS04, 100 mM KCl. In each reaction, 40 ng of en2yme were used to extend 10 
piM (dT)25-3o primer in the preesnce of either 10 ^iM poly (A)286 or 1 |aM poly (dA)273 
template, 45 \iM dTTP and 5 yM Fl-dUTP at 60°C for 30 minutes. Reactions were 
stopped with 10 \i\ of stop solution (95% formamide, 10 mM EDTA, 0.02% methyl violet 
dye). Samples (3 ^1) were fractionated on a 15% denaturing acrylamide gel (19: 1 
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crossed-linked) and the fraction of incorporated Fl-dUTP was quantitated using an 
FMBIO-100 fluorescence gel scanner (Hitachi) equipped with a 505 nm emission filter. 

DNA sequence analysis of the recombinant gene showed that it had changes in 
the polymerase domain resulting in two amino acid substitutions: an A to G change at 

5 nucleotide position 1 394, which causes a Glu to Gly change at amino acid position 465 
(numbered according to the natural nucleic and amino acid sequences, SEQ ID NOS:153 
and 1 57), and another A to G change at nucleotide position 2260, which causes a Gin to 
Arg change at amino acid position 754. Because the Ghi to Gly mutation is at a 
nonconserved position and because the Glu to Arg mutation alters an amino acid that is 

10 conserved in virtually all of the known Type A polymerases, the latter mutation is most 
Ukely the one responsible for curtaiUng the synthesis activity of this protein. The 
nucleotide sequence for the construct is given in SEQ ID NO:39. The enzyme encoded 
by this sequence is referred to as Taq A/G. 

15 b. Initial TthPol Isolation 

The DNA polymerase enzyme from the bacterial species Thermus thermophilus 
(Tth) was produced by cloning the gene for this protein into an expression vector and 
overproducing it in E. coli cells. Genomic DNA was prepared from 1 vial of dried 
Thermus thermophilus sfrain HB-8 from ATCC (ATCC #27634). The DNA polymerase 

20 gene was amphfied by PGR using the following primers: 

5'-CACGAATTCCGAGGCGATGCTTCCGCTC-3' (SEQ ID NO:240) and 
5'-TCGACGTCGACTAACCCTTGGCGGAAAGCC-3' (SEQ ID NO:241). The 
resulting PGR product was digested with EcoRl and Sail restriction endonucleases and 
inserted into EcoKUSal I digested plasmid vector pTrc99G (described in Example 2C1) 

25 to create the plasmid pTrcTth- 1 . This Tth polymerase construct is missing a single 
nucleotide that was inadvertently omitted from the 5' ohgonucleotide, resulting in the 
polymerase gene being out of frame. This mistake was corrected by site specific 
mutagenesis of pTrcTth-1 as described in Examples 4 and 5 using the following 
ohgonucleotide: 5'-GCATCGCCTCGGAATTCATGGTC-3' (SEQ ID NO:242), to create 

30 the plasmid pTrcTth-2. The protein and the nucleic acid sequence encoding the protein 
are referred to as TthPol, and are Usted as SEQ ID NOS:243 and 244 respectively. 
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c. Large Scale preparation of recombinant proteins 

The recombinant proteins were purified by the following technique which is 
derived jfrom a Taq DNA polymerase preparation protocol (Engelke et al. Anal. 
5 Biochem., 191 :396 [1990]) as follows. E. coli cells (strain JM109) containing either 
pTrc99A TaqPol, pTrc99GTthPol were inoculated into 3 ml of LB containing 100 mg/ml 
ampicillin and grown for 16 hrs at 37°C. The entire overnight culture was inoculated into 
200 ml or 350 ml of LB containing 100 mg/ml ampicillin and grown at 37°C with 
vigorous shaking to an Aeoo of 0.8. IPTG (1 M stock solution) was added to a final 
10 concentration of 1 mM and growth was continued for 16 hrs at 37°C. 

The induced cells were pelleted and the cell pellet was weighed. An equal 
J volume of 2X DG buffer (1 00 mM Tris-HCl, pH 7.6, 0. 1 mM EDTA) was added and the 

pellet was suspended by agitation. Fifty mg/ml lysozyme (Sigma) were added to 1 
f ^ mg/ml final concentration and the cells incubated at room temperature for 1 5 min. 

Ui 15 Deoxycholic acid (10% solution) was added dropwise to a final concentration of 0.2 % 
^ ' while vortexing. One volume of H2O and 1 volume of 2X DG buffer were added, and the 

0 resulting mixture was sonicated for 2 minutes on ice to reduce the viscosity of the 

m mixture. After sonication, 3 M (NH4)2S04 was added to a final concentration of 0.2 M, 

X and the lysate was centrifixged at 14000 x g for 20 min at 4°C. The supernatant was 

¥ 20 removed and incubated at 70°C for 60 min at which time 10% polyethylimine (PEI) was 
added to 0.25%, After incubation on ice for 30 min., the mixture was centrifuged at 
14,000 X g for 20 min at 4^C. At this point, the supematant was removed and the protein 
precipitated by the addition of (]S[H4)2S04 as follows. 

Two volumes of 3 M (NH4)2S04 were added to precipitate the protein. The 
25 mixture was incubated ovemight at room temperature for 16 hrs centrifuged at 14,000 x g 
for 20 min at 4'*C. The protein pellet was suspended in 0.5 ml of Q buffer (50 mM 
Tris-HCl, pH 8.0, 0.1 mM EDTA, 0.1% Tween 20). For the Mja FEN-1 preparation, 
solid (NH4)2S04 was added to a final concentration of 3 M (--75% saturated), the mixture 
was incubated on ice for 30 min, and the protein was spun down and suspended as 
30 described above. 
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The suspended protein preparations were quantitated by determination of the A279 
dialyzed and stored in 50% glycerol, 20 mM Tris HCl, pH8.0, 50 mM KCl, 0.5% Tween 
20, 0.5% Nonidet P-40, with 100 pg/ml BSA. 

B. DNA polymerases of Thermus filiformis and Thermus scotoductus 
1. Cloning of Thermus filiformis and Thermus scotoductus 

One vial of lyophilized Thermus filiformis (Tfi) obtained from DSMZ (Deutsche 
Sammlung von Mikroorganismen und Zellculturen, Braunschweig, Germany, strain 
#4687) was rehydrated in 1 ml of Castenholz medium (DSMZ medium 86) and 
inoculated into 500 ml of Castenholz medium preheated to 50°C. The culture was 
incubated at 70°C with vigorous shaking for 48 hours. After growth, the cells were 
harvested by centrifugation at 8000 x g for 10 minutes, the cell pellet was suspended in 
10 ml of TE (10 mM TrisHCL, pH 8.0, 1 mM EDTA), and the cells were frozen at -20°C 
in 1 ml aliquots. A 1 ml aliquot was thawed, lysozyme was added to 1 mg/ml, and the 
cells were incubated at 23°C for 30 minutes. A solution of 20% SDS (sodium dodecyl 
sulfate) was added to a final concentration of 0.5% followed by extraction with buffered 
phenol. The aqueous phase was fiuther extracted with 1:1 phenolxhloroform, and 
extracted a final time with chloroform. One-tenth volume of 3 M sodium acetate, pH 5.0 
and 2.5 volumes of ethanol were added to the aqueous phase and mixed. The DNA was 
pelleted by centrifiigation at 20,000 x g for 5 minutes. The DNA pellet was washed with 
70% ethanol, air dried and resuspended in 200 |al of TE and used directly for 
ampUfication. Thermus scotoductus (Tsc, ATCC # 51532) was grown and genomic DNA 
was prepared as described above for Thermus filiformis. 

The DNA polymerase I gene from Tfi (GenBank accession #AF030320) could not 
be amplified as a single fragment. Therefore, it was cloned in 2 separate fragments into 
the expression vector pTrc99a The 2 fragments overlap and share a Not I site which was 
created by introducing a silent mutation at position 1308 of the Tfi DNA polymerase open 
reading frame (ORE) in the PCR oligonucleotides. The 3' half of the gene was ampUfied 
using the Advantage cDNA PCR kit (Clonetech) with the following oUgonucleotides; 
5'-ATAGCCATGGTGGAGCGGCCGCTCTCCCGG (SEQ ID NO:245) and 
5'-AAGCGTCGACTCAATCCTGCTTCGCCTCCAGCC (SEQ ID NO:246). The PCR 
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product from this reaction was approximately 1200 base pairs in length. It was cut with 
the restriction enzymes Not I and Sal I, and the resulting DNA was ligated into pTrc99a 
cut with NotI and SaU to create pTrc99a-Tfi3'. The 5' half of the gene was ampUfied as 
described above using the following two primers; 

5'AATCGAATTCACCCCACTTTTTGACCTGGAGG (SEQ ID NO:247) and 
5'-CCGGGAGAGCGGCCGCTCCAC (SEQ ID NO:248). The resulting 1300 base pair 
fragment was cut with restriction enaymes Eco RI and Not I and ligated into 
pTrc99a-Tfi3' cut with Notl and EcoKL to produce pTrc99a-TfiPol, SEQ ID NO:249 (the 
corresponding amino acid sequence is listed in SEQ ID NO:250). 

The DNA polymerase I gene from Thermus scotoductus was amplified using the 
Advantage cDNA PGR kit (Clonetech) using the following two primers; 
5'-ACTGGAATTCCTGCCCCTCTTTGAGCCCAAG (SEQ ID NO:251) and 
5'-AACAGTCGACCTAGGCCTTGGCGGAAAGCC (SEQ ID NO:252). The PGR 
product was cut with restriction enzymes Eco RI and Sal I and ligated into Eco RI, Sal I 
cut pTrc99a to create pTrc99a-TscPol SEQ ID NO:253 (the corresponding amino acid 
sequence is listed in SEQ ID NO:254). 

2. Expression and purification of Thermus filiformis and Thermus 
scotoductus 

Plasmids were transformed into protease deficient E. coli strain BL21 (Novagen) 
or sfrain JM109 (Promega Corp., Madison, WI) for protein expression. Flasks containing 
200 ml of LB containing 100 ng/ml ampicillin were inoculated with either a single 
colony from an LB plate or from a frozen stock of the appropriate sfrain. After several 
hours of growth at 37°C with vigorous shaking, cultures was induced by the addition of 
200 jil of 1 M isothiopropyl-galatoside (IPTG). Growth at 37°C was continued for 16 
hours prior to harvest. Cells were pelleted by centrifugation at 8000 x g for 15 minutes 
followed by suspension of the cell pellet in 5 ml of TEN (10 mM TrisHCl, pH.8.0, 1 mM 
EDTA, 100 mM NaCl). 100 \i\ of 50 mg/ml lysozyme were added and the cells 
incubated at room temperature for 15 minutes. Deoxycholic acid (10%) was added to a 
final concenfration of 0.2%. After thorough mixing, the cell lysates were sonicated for 2 
minutes on ice to reduce the viscosity of the mixture. Cellular debris was pelleted by 
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centrifugation at 4°C for 15 minutes at 20,000 x g. The supernatant was removed and 
incubated at 70°C for 30 min after which 10% polyethylimine (PEI) was added to 0.25%. 
After incubation on ice for 30 minutes, the mixture was centrifiiged at 20,000 x g for 20 
min at 4°C. At this point, the supernatant containing the enzyme was removed, and the 

5 protein was precipitated by the addition of 1 .2 g of ammonium sulfate and incubation at 
4°C for 1 hour. The protein was pelleted by centrifiigation at 4°C for 10 minutes at 
20,000 X g. The pellet was resuspended in 4 ml of HPLC Buffer A (50 mM TrisHCl, pH 
8.0, 1 mM EDTA). The protein was fiirther purified by affinity chromatography using an 
Econo-Pac heparin cartridge (Bio-Rad) and a Dionex DX 500 HPLC instrument. Briefly, 

10 the cartridge was equilibrated with HPLC Buffer A, and the enzyme extract was loaded 
on the column and eluted with a linear gradient of NaCl (0-2 M) in the same buffer. Pure 
protein elutes between 0.5 and 1 M NaCl. The enzyme peak was collected and dialyzed 
in 50% glycerol, 20 mM Tris HCl, pH 8, 50 mM KCl, 0.5% Tween 20, 0.5% Nonidet 
P40, lOOmg/ml BSA. 

15 

C. Generation of polymerase mutants with reduced polymerase activity but 
unaltered 5' nuclease activity 

All mutants generated in section C were expressed and purified as described in 
Example 2A1C. 

20 

1. Modified TaqPol Genes: TaqDN 

A polymerization deficient mutant of Tag DNA polymerase called TaqDN was 
constructed. TaqDN nuclease contains an asparagine residue in place of the wild-type 
aspartic acid residue at position 785 (D785N). 

25 DNA encoding the TaqDN nuclease was constructed from the gene encoding the 

Taq A/G in two rounds of site-directed mutagenesis. First, the G at position 1397 and the 
G at position 2264 of the Taq A/G gene (SEQ ID NO:238) were changed to A at each 
position to recreate a wild-type TaqPol gene. In a second round of mutagenesis, the wild 
type TaqPol gene was converted to the Taq DN gene by changing the G at position 2356 

30 to A. These manipulations were performed as follows. 
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DNA encoding the Taq A/G nuclease was recloned from pTTQlS plasmid into 
the pTrc99A plasmid (Pharmacia) in a two-step procedure. First, the pTrc99A vector 
was modified by removing the G at position 270 of the pTrc99A map, creating the 
pTrc99G cloning vector. To this end, pTrc99A plasmid DNA was cut with Ncol and the 
recessive 3' ends were fiUed-in using the Klenow fragment of E.coli polymerase I in the 
presence of all four dNTPs at 37°C for 15 min. After inactivation of the Klenow 
fragment by incubation at 65°C for 10 min, the plasmid DNA was cut with EcoRI and the 
ends were again fiUed-in using the Klenow fragment in the presence of all four dNTPs at 
37°C for 15 min. The Klenow fragment was then inactivated by incubation at 65°C for 
10 min. The plasmid DNA was ethanol precipitated, recircularized by ligation, and used 
to transform E.coli JM109 cells (Promega). Plasmid DNA was isolated from single 
colonies, and deletion of the G at position 270 of the pTrc99A map was confirmed by 
DNA sequencing. 

In a second step, DNA encoding the Taq A/G nuclease was removed from the 
pTTQl 8 plasmid using EcoRl and SaK and the DNA fragment carrying the Taq A/G 
nuclease gene was separated on a 1% agarose gel and isolated with Geneclean II Kit (Bio 
101, Vista, CA). The purified fragment was hgated into the pTrc99G vector that had 
been cut with EcoRL and Sail. The Ugation mixture was used to fransform competent 
E.coli JM109 cells (Promega). Plasmid DNA was isolated from single colonies and 
insertion of the Taq A/G nuclease gene was confirmed by restriction analysis using 
EcoBl and Sail. 

Plasmid DNA pTrcAG carrying the Taq A/G nuclease gene cloned into the 
pTrc99A vector was purified from 200 ml of JM109 overnight culture using QIAGEN 
Plasmid Maxi kit (QIAGEN, Chatsworth, CA) according to manufacturer's protocol. 
pTrcAG plasmid DNA was mutagenized using two mutagenic primers, E465 (SEQ ID 
NO:255) (hitegrated DNA Technologies, Iowa) and R754Q (SEQ ID NO:256) 
(Integrated DNA Technologies), and the selection primer Trans Oligonucleotide 
AlwNI/Spel (Clontech, Palo Alto, CA, catalog #6488-1) according to TRANSFORMER 
Site-Directed Mutagenesis Kit protocol (Clontech, Palo Alto, CA) to produce a restored 
wild-type TaqPol gene (pTrcWT). 
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pTrcWT plasmid DNA carrying the wild-type TaqPol gene cloned into the 
pTrc99A vector was purified from 200 ml of JM109 overnight culture using QIAGEN 
Plasmid Maxi kit (QIAGEN, Chatsworth, CA) according to manufacturer's protocol. 
pTrcWT was then mutagenized using the mutagenic primer D785N (SEQ ID NO:257) 

5 (Integrated DNA Technologies) and the selection primer Switch Oligonucleotide 

SpeFAlwNI (Clontech, Palo Alto, CA, catalog #6373-1) according to "mANSFORMER 
Site-Directed Mutagenesis Kit protocol (Clontech, Palo Alto, CA) to create a plasmid 
containing DNA encodmg the Taq DN nuclease. The DNA sequence encoding the Taq 
DN nuclease is provided in SEQ ED NO:258; the amino acid sequence of Taq DN 

10 nuclease is provided in SEQ ID NO:259. 

2. Modified TthPol Gene: Tth DN 

The Tth DN construct was created by mutating the TthPol described above. The 
sequence encoding an aspartic acid at position 787 was changed by site-specific 
15 mutagenesis as described above to a sequence encoding asparagine. Mutagenesis of 
pTrcTth-2 with the following oligonucleotide: 

5'-CAGGAGGAGCTCGTTGTGGACCTGGA-3' (SEQ ID NO:260) was performed to 
create the plasmid pTrcTthDN. The mutant protein and protein coding nucleic acid 
sequence is termed TthDN SEQ ID NOS:261 and 262 respectively. 

20 

3. Taq DN HI and Tth DN HT 

Six amino acid histidine tags (his-tags) were added onto the carboxy termini of 
Taq DN and Tth DN. The site-directed mutagenesis was performed using the 
TRANSFORMER Site Directed Mutagenesis Kit (Clontech) according to the 
25 manufacturer's instructions. The mutagenic oligonucleotides used on the plasmids pTaq 
DN and pTth DN were sequence 1 17-067-03, 

5'-TCTAGAGGATCTATCAGTGGTGGTGGTGGTGGTGCTCCTTGGCGGAGAGC- 
3'(SEQIDNO:263)and 

5'-TGCCTGCAGGTCGACGCTAGCTAGTGGTGGTGGTGGTGGTGACCCTTGGCG 
30 GAAAGCC-3' (SEQ ID NO:264), sequence 136-037-05. The selection primer Trans 
Oligo AlwNI/Spel (Clontech, catalog # 6488-1) was used for both mutagenesis reactions. 
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The resulting mutant genes were termed Taq DN HT (SEQ ID NO:265, nucleic acid 
sequence; SEQ ID NO:266, amino acid sequence) and Tth DN HT (SEQ ID NO:267, 
nucleic acid sequence; SEQ ID NO:268, amino acid sequence). 

5 4. Purification of Taq DN HT and Tth DN HT 

Both Taq DN HT and Tth DN HT proteins were expressed in E. coli strain JM109 
as described in Example 2B2. After ammonium sulfate precipitation and centrifugation, 
the protein pellet was suspended in 0.5 ml of Q buffer (50 mM Tris-HCl, pH 8.0, 0.1 mM 
EDTAm 0.1% Tween 20). The protein was further purified by affinity chromatography 
10 using His-Bind Resin and Buffer Kit (Novagen) according to the manufacturer's 
instructions. 1 ml of His-Bind resin was transferred into a column, washed with 3 
C column volumes of sterile water, charged with 5 volumes of 1 X Charge Buffer, and 

i equilibrated with 3 volumes of IX Binding Buffer. Four ml of IX Binding Buffer was 

"t added to the protein sample and the sample solution was loaded onto the column. After 

15 washing with 3 ml of IX Binding Buffer and 3 ml of IX Wash Buffer, the bound His-Tag 
protein was eluted with 1 ml of IX Elute Buffer. The pure enzyme was then dialyzed in 
50% glycerol, 20 mM Tris-HCl, pH 8.0, 50 mM KCl, 0.5% Tween 20, 0.5% Nonidet 
P40, and 100 ^g.ml BSA. Enzyme concentrations were determined by measuring 
absorption at 279 mn. 

20 

EXAMPLE 3 

RNA-dependent 5' nuclease activity of TtliPoI can be conferred on TaqPol by 
transfer of tlie N-terminal portion of the DNA polymerase domain 

25 

A. Preparation and purification of substrate structures having either a DNA or 
an RNA target strand 

The downstream (SEQ ID NO:162) and upstream probes (SEQ ID N0:161) and 
the IL-6 DNA (SEQ ID NO: 163) (Figure 10) target strand were synthesized on a 
30 PerSeptive Biosystems instrument using standard phosphoramidite chemistry (Glen 

Research). The synthetic RNA-DNA chimeric IrT target labeled with biotin at the 5 '-end 
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(Figure 20A) was synthesized utilizing 2'-ACE RNA chemistry (Dharmacon Research). 
The 2'-protecting groups were removed by acid-catalyzed hydrolysis according to the 
manufacturer's instructions. The downstream probes labeled with S'-fluorescein (Fl) or 
5'-tetrachloro-fluorescein (TET) at their 5' ends were purified by reverse phase HPLC 
using a Resource Q column (Amersham-Pharmacia Biotech). The 648-nucleotide IL-6 
RNA target (SEQ ID NO: 160) (Figure 10) was synthesized by T7 RNA polymerase 
runoff-transcription of the cloned fragment of human IL-6 cDNA (nucleotides 64-691 of 
the sequence pubUshed in May et al., Proc. Natl. Acad. Sci., 83:8957 [1986]) using a 
Megascript Kit (Ambion). All oUgonucleotides were finally purified by separation on a 
20% denaturing polyacrylamide gel followed by excision and elution of the major band. 
OUgonucleotide concentration was determined by measuring absorption at 260 nm. The 
biotin labeled IrT target was incubated with a 5-fold excess of streptavidin (Promega) in a 
buffer containing 10 mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% NP-40 and 10 ^ig/ml 
tRNA at room temperature for 10 min. 

B. Introduction of restriction sites to malce chimeras 

The restriction sites used for formation of chimerical proteins, described below, 
were chosen for convenience. The restriction sites in the following example have been 
strategically placed to surround regions shown by crystal structure and other analysis to 
be functional domains (See, Figures 6, 7, and 19). Different sites, either naturally 
occurring or created via directed mutagenesis can be used to make similar constructs with 
other Type A polymerase genes from related organisms. It is desirable that the mutations 
all be silent with respect to protein fimction. By studying the nucleic acid sequence and 
the amino acid sequence of the protein, one can introduce changes in the nucleic acid 
sequence that have no effect on the corresponding amino acid sequence. If the nucleic 
acid change required affects an amino acid, one can make the alteration such that the new 
amino acid has the same or similar characteristics of the one replaced. If neither of these 
options is possible, one can test the mutant enzyme for function to determine if the 
nucleic acid alteration caused a change in protein activity, specificity or function. It is 
not intended that the invention be limited by the particular restriction sites selected or 
introduced for the creation of the improved enzymes of the present invention. 



193 



C Generation of Tth DN RX HT and Taq DN RX HT 

Mutagenesis was performed to introduce 3 additional, unique restriction sites into 
the polymerase domain of both the Taq DN HT and Tth DN HT en2ymes. Site-specific 

5 mutagenesis was performed using the Transformer Site-Directed Mutagenesis Kit firom 
(Clonetech) according to manufacturer's instructions. One of two different selection 
primers, Trans Oligo AlwNI/Spel or Switch Oligo Spel/AlwNI (Clontech, Palo Alto CA 
catalog #6488-1 or catalog #6373-1) was used for all mutagenesis reactions described. 
The selection oUgo used in a given reaction is dependent on the selection restriction site 

10 present in the vector. All mutagenic primers were synthesized by standard synthetic 
chemistry. Resultant colonies were expressed in E.coli strain JM109. 

The Not I sites (amino acid position 328) were created using the mutagenic 
primers 5'-gccgccaggggcggccgcgtccaccgggcc (SEQ ID NO:269) and 
5'-gcctgcaggggcggccgcgtgcaccggggca (SEQ ID NO:270) corresponding to the sense 

15 strands of the Taq DN HT and the Tth DN HT genes, respectively. The BstI (amino acid 
position 382) and Ndel (amino acid position 443) sites were introduced into both genes 
using sense strand mutagenic primes 5'-ctcctggacccttcgaacaccacccc (SEQ ID NO:271) 
and 5'-gtcctggcccatatggaggccac (SEQ ID NO:272). The mutant plasmids were 
over-expressed and purified using Qiagen QiaPrep Spin Mini Prep Kit (cat. # 27106). 

20 The vectors were tested for the presence of the restriction sites by DNA sequencing and 
restriction mapping. These constructs are termed Tth DN RX HT (DNA sequence SEQ 
ID NO:273; amino acid sequence SEQ ID NO:274) and Taq DN RX HT (DNA sequence 
SEQ ID NO:275; amino acid sequence SEQ ID NO:276). 



25 D. Chimeras 

The chimeric constructs shown in Figure 19 were created by exchanging 
homologous DNA fi-agments defined by the restriction endonuclease sites EcoRI (E) and 
BamHI (B), common for both genes, the cloning vector site Sail (S) and the new sites, 
NotI (N), BstBI (Bs), Ndel (D) created at the homologous positions of both genes by site 
30 directed mutagenesis. In generating these chimeric enzymes, two different pieces of 
DNA are ligated together to yield the final construct. The larger piece of DNA that 
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contains the plasmid vector as well as part of the Taq or Tth (or parts of both) sequence 
will be termed the "vector." The smaller piece of DNA that contains sequences of either 
the Taq or Tth (or parts of both) polymerase will be termed the 'Insert " 

All restriction enzymes were from New England Biolabs or Promega and used in 
5 reactions with the accompanying buffer, according to the manufacturer's instructions. 
Reactions were done in 20 \x\ volume with about 500 ng of DNA per reaction, at the 
optimal temperature for the specified enzyme. More than one enzyme was used in a 
single reaction (double digest) if the en2ymes were compatible with respect to reaction 
buffer conditions and reaction temperature. If the enzymes in question were not 

10 compatible with respect to buffer conditions, the enzyme requiring the lowest salt 

condition was used first. After the completion of that reaction, buffer conditions were 
changed to be optimal or better suited to the second enzyme, and the second reaction was 
performed. These are common restriction enzyme digest strategies, well known to those 
in the art of basic molecular biology (Maniatis, supra), 

15 The digested restriction fi:agments were gel isolated for optimal ligation 

efficiency. Two |Lil of lOX loading dye (50% glycerol, IX TAB, 0.5% bromophenol 
blue) were added to the 20 \il reaction. The entire volume was loaded and run on a P/o, 
IX TAE agarose gel containing 1 |al of a 1% ethidium bromide solution per 100 ml of 
agarose gel solution. The digested fragments were visuaUzed imder UV light, and the 

20 appropriate fragments (as determined by size) were excised from the gel. These 

fragments were then purified using the Qiagen Gel Extractio Kit, (cat # 28706) according 
to the manufacturer's instructions. 

Ligations were performed in a 10 ^1 volume, using 400 units per reaction of T4 
DNA Ligase enzyme from New England Biolabs (catalog #202L), with the 

25 accompanying reaction buffer. Ligation reactions were done at room temperature for 1 
hour, with 1 |al of each of the Qiagen-purified fragments (approximately 20-50 ng of each 
DNA, depending on recovery from the gel isolation). Ligation products were then 
transformed into E. coli strain JM 109 and plated onto an appropriate growth and 
selection medium, such as LB with lOOp-g/ml of ampicillin to select for transformants. 

30 For each ligation reaction, six transformants were tested to determine if the 

desired construct was present. Plasmid DNA was purified and isolated using the QiaPrep 
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Spin Mini Prep Kit, according to manufacturer's instructions. The constructs were 
verified by DNA sequencing and by restriction mapping. 

Expression and purification of the chimeric enzymes was done as follows. 
Plasmids were transformed into E. coli strain JM109 (Promega). Log phase cultures (200 

5 ml) of JM109 were induced with 0.5 mM IPTG (Promega) and grown for an additional 
16 hours prior to harvest. Crude extracts containing soluble proteins were prepared by 
lysis of pelleted cells in 5 ml of 10 mM Tris-HCl, pH 8.3, ImM EDTA, 0.5mg/ml 
lysozyme during incubation at room temperature for 1 5 minutes. The lysate was mixed 
with 5 ml of 10 mM Tris-HCl pH 7.8, 50 mM KCl, 1 mM EDTA, 0.5% Tween 20, 0.5% 

10 Nonidet P-40, heated at 72°C for 30 minutes, and cell debris was removed by 

centrifugation at 12,000x g for 5 minutes. Final purification of the protein was done by 
affinity chromatograpy using an Econo-Pac heparin cartridge (Bio-Rad) and Dionex DX 
500 HPLC instrument. Briefly, the cartridge was equiUbrated with 50mM Tris-HCl pH 
8, 1 mM EDTA, and an enzyme extract dialyzed against the same buffer was loaded on 

15 the column and eluted with a hnear gradient of NaCl (0-2 M) in the same buffer. The 
HPLC-purified protein was dialyzed and stored in 50% (vol/vol) glycerol, 20 mM 
Tris-HCl pH 8.0, 50 mM KCl, 0.5% Tween 20, 0.5% Nonidet P-40, and lOO^g/m BSA. 
The enzymes were purified to homogeneity according to SDS-PAGE, and the enzyme 
concentrations were determined by measuring absorption at 279 mn. 

20 

1. Construction of TaqTth(N) and TthTaq(N) 

The first exchange that was performed involved the polymerase domains of the 
two enzymes. Separation of the nuclease domain (the N-terminal end of the protein) 
firom the polymerase domain (the C-termmal portion of the protein) was accomplished by 

25 cutting both genes with the restriction endonucleases EcoRI and Notl. The 

approximately 900 base pair firagment fi-om the Tth DN RX HT gene was cloned into the 
homologous sites of the Taq DN RX HT gene, and the approximately 900 base pair 
jfragment firom the Taq DN RX HT gene was cloned into the homologous sites of the Tth 
DN RX HT gene, yielding two chimeras, TaqTth(N) (DNA sequence SEQ ID NO:69; 

30 amino acid sequence SEQ ID N0:2) which has the Taq DN RX HT 5' nuclease domain 
and the Tth DN RX HT polymerase domain, and TthTaq(N) (DNA sequence SEQ ID 
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NO:70; amino acid sequence SEQ ID N0:3) which is made up of the Tth DN RX HT 5' 
nuclease domain and the Taq DN RX HT polymerase domain. 



2. Construction of TaqTth(N-B) 

5 The Taq DN RX HT construct was cut with the enzymes Ndel and BamHI and 

the larger, vector fragment was gel isolated as detailed above. The Tth DN RX HT 
construct was also cut with Ndel and BamHI and the smaller (approximately 795 base 
pairs) Tth fragment was gel isolated and purified. The Tth Ndel-BamHI insert was 
ligated into the Taq Ndel-BamHI vector as detailed above to generate the TaqTth(N-B) 

10 (DNA sequence SEQ ID N0:71 ; amino acid sequence SEQ ID N0:4). 

3. Construction of TaqTth(B-S) 

The Taq DN RX HT construct was cut with the enzymes BamHI and Sail and the 
larger vector fragment was gel isolated as detailed above. The Tth DN RX HT construct 
15 was also cut with BamHI and Sail and the smaller (approximately 741 base pairs) Tth 
fragment was gel isolated and purified. The Tth BamHI-Sall insert was ligated into the 
Taq BamHI-Sall vector as detailed above to generate the TaqTth(B-S) (DNA sequence 
SEQ ID NO:72; amino acid sequence SEQ ID N0:5). 

20 4. Construction of TaqTth(N-D) 

The Taq DN RX HT construct was cut with the enzymes Not! and Ndel and the 
larger vector fragment was isolated as detailed above. The Tth DN RX HT construct was 
also cut with NotI and Ndel and the smaller (approximately 345 base pairs) Tth fragment 
was gel isolated and purified. The Tth Notl-Ndel insert was ligated into the Taq 
25 Notl-Ndel vector as detailed above to generate the TaqTth(N-D) (DNA sequence SEQ ID 
NO;73; amino acid sequence SEQ ID N0:6). 

5, Construction of TaqTth(D-B) 

The Taq DN RX HT construct was cut with the enzymes Ndel and BamHI and 
30 the larger vector fragment was isolated as detailed above. The Tth DN RX HT construct 
was also cut with Ndel and BamHI and the smaller (approximately 450 base pairs) Tth 
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fragment was gel isolated and purified. The Tth Ndel-BamHI insert was ligated into the 
Taq Ndel-BamHI vector as detailed above to generate the TaqTth(D-B) (DNA sequence 
SEQ ID NO:74; amino acid sequence SEQ ID N0:7). 

5 6, Construction of TaqTth(Bs-B) 

The Taq DN RX HT construct was cut with the enzymes BstBI and BamHI and 
the larger vector fragment was isolated as detailed above. The Tth DN RX HT construct 
was also cut with BstBI and BamHI and the smaller (approximately 633 base pairs) Tth 
fragment was gel isolated and purified. The Tth Ndel-BamHI insert was ligated into the 
10 Taq Ndel-BamHI vector as detailed above to generate TaqTth(Bs-B) (DNA sequence 
SEQ ID NO:75; amino acid sequence SEQ ID N0:8). 

7. Construction of TaqTth(N-Bs) 

The Taq DN RX HT construct was cut with the enzjmies NotI and BstBI and the 
15 larger vector fi*agment was isolated as detailed above. The Tth DN RX HT construct was 
also cut with NotI and BstBI and the smaller (approximately 162 base pairs) Tth fragment 
was gel isolated and purified. The Tth Notl-BstBI insert was ligated into the Taq 
Notl-BstBI vector as detailed above to generate TaqTth(N-Bs) (DNA sequence SEQ ID 
NO:76; amino acid sequence SEQ ID NO:9). 

20 

8. Construction of TthTaq(B-S) 

The Tth DN RX HT construct was cut with the enzymes BamHI and Sail and the 
larger vector fi*agment was isolated as detailed above. The Taq DN RX HT construct was 
also cut with BamHI and Sail and the smaller (approximately 741 base pairs) Tth 
25 fi-agment was gel isolated and purified. The Taq BamHI-Sall insert was ligated into the 
Tth BamHI-Sall vector as detailed above to generate the TthTaq(B-S) (DNA sequence 
SEQ ID NO:77; amino acid sequence SEQ ID NO: 1 0). 

9. Construction of Tth Taq(N-B) 

30 The Tth DN RX HT construct was cut with the enzymes NotI and BamHI and the 

larger vector fragment was isolated as detailed above. The Taq DN RX HT construct was 
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also cut with NotI and BamHI and the smaller (approximately 795 base pairs) Tth 
fragment was gel isolated and purified. The Taq Notl-BamHI insert was ligated into the 
Tth Notl-BamHI vector as detailed above to generate the TthTaq(N-B) (DNA sequence 
SEQ ID NO:78; amino acid sequence SEQ ID NO: 11), 

5 

The cleavage activities of these chimerical proteins were characterized as describe 
in Example 1, part A, and a comparison of the cleavage cycling rates on an RNA target is 
shown in Figure 12. As further discussed in the Description of the Invention, these data 
show that elements found in the central third of the TthPol protein are important in 
10 conferring the TthPol-like RNA-dependent cleavage activity on the chimerical proteins 
comprising portions of TaqPol. 

EXAMPLE 4 

Alterations influencing RNA-dependent 5' nuclease activity do not necessarily 
15 influence RNA-dependent DNA polymerase activity 

TthPol is known to have a more active RNA template dependent DNA 
polymerase than does the TaqPol (Myers and Gelfand, Biochemistry 30:7661 [1991]). 
To determine whether the RNA template dependent 5* nuclease activity of the Thermus 

20 DNA Pol I enzymes is related to their RNA-dependent polymerase activity, the D785N 
and D787N mutations used to create the polymerase-deficient versions of TaqPol and 
TthPol, respectively were reversed. Polymerase activity was similarly restored to the 
TaqTth (N) (DNA sequence SEQ ID NO:79; amino acid sequence SEQ ID NO:12), 
TaqTth(N-B) (DNA sequence SEQ ID NO:80; amino acid sequence SEQ ID NO: 13), 

25 TaqTth(B-S) (DNA sequence SEQ ID N0:81; amino acid sequence SEQ ID N0:14) 
chimeras, and the TaqPol(W417L/G418K/ E507Q) (DNA sequence SEQ ID NO:82; 
amino acid sequence SEQ ID NO: 15) mutant proteins. 

Polymerase function was restored in all the above mentioned en2yme mutants by 
inserting the BamHI to Sail fragment of the native, non-DN sequence into the selected 

30 chimera or mutant enzyme. For example, the mutant construct TaqTth(N-B) was cut 
with the restriction enzyme BamHI (approximate amino acid position 593) and the 
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restriction enzyme Sail (approximate amino acid position 840). The larger vector 
fragment was gel purified as described in Example 3D. The native TaqPol construct was 
also cut with the restriction endonucleases BamHI and Sail, and the smaller insert 
fragment containing the native amino acid sequence was also gel purified. The insert 
5 fragment was then ligated into the vector as detailed in Experimental Example 3D. 

The polymerase activities of these proteins were evaluated by extension of the 
dT25-35-oligonucleotide primer with fluorescein-labeled dUTP in the presence of either 
poly(dA) or poly(A) template. Primer extension reactions were carried out in 10 )al 
buffer containing 10 mM MOPS, pH7.5, 5 mM MgS04, 100 mM KCl. Forty ng of 

10 enzyme were used to extend 10 \iM (dT)25-30 primer in the presence of either 10 |iM 

poly(A)286 or 1 poly(dA)273 template, 45 luM dTTP and 5 ^iM Fl-dUTP at 60^C for 30 
min. Reactions were stopped with 10 |il of stop solution (95% formamide, 10 mM 
EDTA, 0,02% methyl violet dye). Samples (3 jal) were fractionated on a 15% denaturing 
acrylamide gel and the fraction of incorporated Fl-dUTP was quantitated using an 

15 FMBIO-100 fluorescent gel scanner (Hitachi) equipped with a 505 nm filter as described 
above. 

As shown in Figure 16, the DNA-dependent polymerase activities are very similar 
for all constructs used in this experiment, whereas the RNA-dependent polymerase 
activities of TthPol, TaqTth(N) and TaqTth(B-S) are at least 6-fold higher than the 
20 activities of TaqPol, TaqTth(N-B) and the TaqPol W417L/G418K/E507Q mutant. From 
the analysis of these results, it can be concluded that the high RNA-dependent DNA 
polymerase activity of TthPol is determined by the C-terminal half of the polymerase 
domain (roughly, amino acids 593-830) and that the RNA-dependent 5' nuclease and 
polymerase activities are not related to each other, and are controlled by different regions. 

25 

EXAMPLE 5 

Specific point mutants in Taq DN RX HT developed from information from the 

chimeric studies 

30 The chimeric studies (Example 3, above) suggest that the part of the TthPol 

sequence determining its high RNA-dependent 5' nuclease activity comprises the 
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BstBI-BamHI region located approximately between amino acid 382 and 593. 
Comparison of the amino acid sequences between the BstBI and BamHI regions of Tth 
DN RX HT and Taq DN RX HT (SEQ ID NOS: 165 and 164, respectively) revealed only 
25 differences (Figure 13). Among these, 12 amino acid changes were conservative 

5 while 13 of the differences resulted in a changes in charge. Since the analysis of the 

chimeric enzymes suggested that the critical mutations are located in both the BstBI-Ndel 
and the Ndel-BamHI regions of Tth DN RX HT, site specific mutagenesis was used to 
introduce the Tth DN RX HT specific amino acids into the BstBI-Ndel and Ndel-BamHI 
regions of the TaqTth(D-B) and the TaqTth(N-D) respectively. 

10 Six Tth DN RX HT specific substitutions were generated in the BstBI-Ndel 

region of the TaqTth(D-B) by single or double amino acid mutagenesis. Similarly, 12 
Tth DN RX HT specific amino acid changes were introduced at the homologous position 
of the Ndel-BamHI region of the TaqTth(N-D). 

Plasmid DNA was purified from 200 ml of JM109 overnight culture using 

15 QIAGEN Plasmid Maxi Kit (QIAGEN, Chatsworth, CA) according to the manufacturer's 
protocol to obtain enough starting material for all mutagenesis reactions. All site specific 
mutations were introduced using the Transformer Site Directed mutagenesis Kit 
(Clontech) according to the manufacturer's protocol; specific sequence information for 
the mutagenic primers used for each site is provided below. One of two different 

20 selection primers, Trans Oligo AlwNI/Spel or Switch Oligo Spel/AlwNI (Clontech, Palo 
Alto, CA catalog #6488-1 or catalog #6373-1) was used for all mutagenesis reactions 
described. The selection ohgo used in a given reaction is dependent on the restriction site 
present in the vector. All mutagenic primers were synthesized by standard synthetic 
chemistry. Resultant colonies were Exoli strain JM109, 

25 

1. Construction of TaqTth(D-B) E404H (DNA sequence SEQ ID NO:83; 
amino acid sequence SEQ ID NO: 16) 

Site specific mutagenesis was performed on pTrc99A TaqTth(D-B) DNA using 
the mutagenic primer 240-60-01 5'-gag gag gcg ggg cac egg gcc gcc ctt-3' (SEQ ID 
30 NO:277) to introduce the E404H mutation. 
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2. Construction of TaqTth(D-B) F413H/A414R (DNA sequence SEQ ID 
NO:84; amino acid sequence SEQ ID NO: 17) 

Site specific mutagenesis was performed on pTrc99A TaqTth(D-B) DNA using 
the mutagenic primer 240-60-02 5'-ctt tec gag agg etc cat egg aac ctg tgg ggg agg-3' 
(SEQ ID NO:278) to introduce the F413H and the A414R mutations. 

3. Construction of TaqTth(D-B) W417L/G418K (DNA sequence SEQ ID 
NO:85; amino acid sequence SEQ ID NO:18) 

Site specific mutagenesis was performed on pTrc99A TaqTth(D-B) DNA using 
the mutagenic primer 240-60-03 5 '-etc ttc gee aac ctg ctt aag agg ctt gag ggg gag-3' 
(SEQ ID NO:279) to introduce the W417L and the G418K mutations. 

4. Construction of TaqTth(D-B) A439R (DNA sequence SEQ ID NO:86; 
amino acid sequence SEQ ID NO: 19) 

Site specific mutagenesis was performed on pTrc99A TaqTth(ND-B) DNA using 
the mutagenic primer 240-60-04 5 '-agg ccc ctt tec egg gtc ctg gcc cat-3' (SEQ ID 
NO:280) to introduce the A439R mutation. 

5. Construction of TaqTth(N-D) L4S1R (DNA sequence SEQ ID NO:87; 
amino acid sequence SEQ ID NO:20) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-05 5'-acg ggg gtg cgc egg gac gtg gcc tat-3' (SEQ ID NO:281) 
to introduce the L415 mutation. 

6. Construction of TaqTtli(N-D) R457Q (DNA sequence SEQ ID NO:88; 
amino acid sequence SEQ ID NO:21) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-06 5 '-gtg gcc tat etc cag gcc ttg tec ctg-3' (SEQ ID NO:282) to 
introduce the L415Q mutation. 
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?• Construction of TaqTth(N-D) V463L (DNA sequence SEQ ID NO:89; 
amino acid sequence SEQ ID NO:22) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-07 5'-ttg tec ctg gag ctt gcc gag gag atc-3' (SEQ ID NO:283) 
5 to introduce the V463L mutation. 

8. Construction of TaqTth(N-D) A468R (DNA sequence SEQ ID NO:90; 
amino acid sequence SEQ ID NO:23) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
10 mutagenic primer 240-60-08 5 '-gcc gag gag ate cgc cgc etc gag gcc-3' (SEQ ID NO:284) 
to introduce the A468R mutation. 

9- Construction of TaqTth(N"D) A472E (DNA sequence SEQ ID NO:91; 
amino acid sequence SEQ ID NO:24) 

15 Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 

mutagenic primer 240-60-09 5'-gcc cgc etc gag gag gag gtc ttc cgc-3' (SEQ ID NO:285) 
to introduce the A472E mutation. 

10. Construction of TaqTth(N-D) G499R (DNA sequence SEQ ID NO:92; 
20 amino acid sequence SEQ ID NO:25) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-10 5'-ttt gac gag eta agg ctt ccc gcc atc-3' (SEQ ID NO:286) to 
introduce the G499R mutation. 

25 11. Construction of TaqTth(N-D) E507Q (DNA sequence SEQ ID NO:93; 

amino acid sequence SEQ ID NO:26) 
Site specific mutagenesis was performed on pTrc99Ata(jTth(N-D) DNA using the 
mutagenic primer 276-046-04 5'-atc gcc aag acg caa aag acc ggc aag-3' (SEQ ID 
NO:287) to introduce the E507Q mutation. 

30 
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12. Construction of TaqTth(N-D) Y535H (DNA sequence SEQ ID NO:94; 
amino acid sequence SEQ ID NO:27) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-1 1 5'-aag ate ctg cag cac egg gag etc acc-3' (SEQ ID NO:288) 
to introduce the Y535H mutation. 

13. Construction of TaqTth(N-D) S543N (DNA sequence SEQ ID NO:95; 
amino acid sequence SEQ ID NO:28) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-12 5'-acc aag ctg aag aac acc tac att gac-3' (SEQ ID NO:289) 
to introduce the S543N mutation. 

14. Construction of TaqTth(N-D) I546V (DNA sequence SEQ ID NO :96; 
amino acid sequence SEQ ID NO:29) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-13 5'-aag age acc tac gtg gac ccc ttg ccg-3' (SEQ ID NO:290) 
to introduce the I546V mutation. 

15. Construction of TaqTth(N-D) D551S/I553V (DNA sequence SEQ ID 
NO:97; amino acid sequence SEQ ID NO:30) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-14 5'-att gac ccc ttg ccg age etc gtc cac ccc agg acg ggc-3' 
(SEQ ID NO:291) to introduce Ihe D551S and the I553V mutations. 

16. Construction of TaqDN RX HT W417L/G418K/E507Q (DNA 
sequence SEQ ID NO:98; amino acid sequence SEQ ID NO:31) 

The TaqDN RX HT W417L/G41 8K/E507Q triple mutant was made by. 
combining the TaqTth(D-B)W417L/G418K with the TaqTth(N-D) E507Q. 
TaqTth(D-B)W417L/G418K was cut with the restriction enzymes Ndel and BamHI, and 
the larger, vector firagment was isolated as detailed in Example 3. The TaqTth(N-D) 
E507Q construct was also cut with Ndel and BamHI and the smaller (approximately 795 
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base pairs) fragment was gel isolated and purified as detailed in Example 3. The 
Ndel-BamHI insert was ligated into the gel purified vector, as detailed in Example 3. 

17. Construction of TaqDN RX HT W417L/E507Q (DNA sequence SEQ 
5 ID NO:99; amino acid sequence SEQ ID NO:32) 

Starting with TaqDN RX HT W417L/G418K/E507Q described above, mutagenic 
primer 337-01-02: 5'-TTC GCC AAC CTG CTT GGG AGG CTT GAG GGG GAG -3' 
(SEQ ID NO:292) was used in a site specific mutagenesis reaction to change the K at 
amino acid position 418 back to the wild-type amino acid, G. Site specific mutagenesis 
10 was done using the Transformer Site Directed Mutagenesis Kit (Clonetech) according to 
the manufacturer's instructions, and as described in Experimental Example 4. 

18. Construction of TaqDN RX HT G418K/E507Q (DNA sequence SEQ 
ID NO:100; amino acid sequence SEQ ID NO:33) 

15 Starting with TaqDN RX HT W417L/G418K/E507Q described above, mutagenic 

primer 337-01-01: 5'-CTC TTC GCC AAC CTG TGG AAG AGG CTT GAG GGG -3' 
(SEQ ID NO:293) was used in a site specific mutagenesis reaction to change the L at 
amino acid position 417 back to the wild-type amino acid, W. Site specific mutagenesis 
was done using the Transformer Site Directed Mutagenesis Kit (Clonetech) according to 

20 the manufacturer's instructions, and as described in Experimental Example 4. 

Expression and purification of mutant proteins was done as detailed in Example 3, 
and the cleavage activities of these proteins were characterized as describe in Example 1, 
part A, A comparison of the cleavage cycling rates of a selection of these mutant 
proteins on an RNA target is shown in Figure 14. As further discussed in the Description 

25 of the Invention, these data show that amino acids in the regions 41 7/418 and amino acid 
507 are important in the conferring the TthPol-like RNA-dependent cleavage activity on 
the chimerical proteins comprising portions of TaqPol in combination with portions of 
TthPol that are not independently capable of providing enhanced RNA dependent activity 
(i.e., the D-B and N-D portions of Tth). As described in the Description of the Invention, 

30 Taq DN RX HT variant carrying only the W417L, G418K and E507Q substitutions were 
created. By comparing their cleavage rates to that of Tth DN RX HT on the IL-6 RNA 
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substrate as described in Example 1, these mutations were determined to be sufficient to 
increase the Taq DN RX HT activity to the Tth DN RX HT level Figure 15 shows that 
the Taq DN RX HT W417L/G418K/E507Q and Taq DN RX HT G418K/E507Q mutants 
have 1 A times higher activity than Tth DN RX HT and more than 4 fold higher activity 
5 than Taq DN RX HT, whereas the Taq DN RX HT W41 7L/E507Q mutant has the same 
activity as the enzyme, which is about 3 fold higher than Taq DN RX HT. These results 
demonstrate that K418 and Q507 of TthPol are particularly important amino acids in 
providing RNA dependent 5' nuclease activity that is enhanced compared to TaqPol. 

10 EXAMPLE 6 

RNA-dependent 5' nuclease properties of the Taq DN RX HT G418K/E507Q 5' 
nuclease are similar to Tth DN RX HT with respect to salt and temperature optima 

To determine if the G418K/E507Q mutations caused any significant changes in 

15 the properties of the Taq DN RX HT mutant in addition to the increased cleavage rate 
with the RNA target, the Taq DN RX HT G418K/E507Q (SEQ ID NO:33), Taq DN RX 
HT (SEQ ID N0276), and Tth DN RX HT (SEQ ID NO:274) enzymes were compared in 
the RNA template dependent 5' nuclease assay under conditions where temperature and 
the concentrations of salt and divalent ions were varied. The upstream DNA and the 

20 template RNA strands of the substrate used in this study were linked into a single IrT 
molecule (SEQ ID NO: 166) as shown in Figure 20 A, and the labeled downstream probe 
(SEQ ID NO: 167) was present in large excess. The 5* end of the target RNA strand was 
blocked with a biotin-streptavidin complex to prevent any non-specific degradation by 
the enzyme during the reaction (Lyamichev et al. Science 260:778 [1993], Johnson et 

25 al. Science 269:238 [1995]). The cleavage rates for Taq DN RX HT G418K/E507Q, 
Taq DN RX HT, and Tth DN RX HT are plotted as functions of temperature in Figure 
20B. The closed circles represent enzyme Taq DN RX HT, the open circles represent 
enzyme Tth DN RX HT, and the Xs represent enzyme Taq DN RX HT G418K/E507Q. 
The difference in the activities of Tth and Taq DN RX HT enzymes with the IrT substrate 

30 is even greater than the difference found with the IL-6 RNA substrate when tested in a 
cleavage assay as described in Example 1. The G418K/E507Q mutations increase the 
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activity of the Taq enzyme more than tenfold and by 25% compared with the Tth 
enzyme. All three enzymes show a typical temperature profile of the invasive signal 
amplification reaction and have the same optimal temperature. No significant effect of 
G418KyE507Q mutations on DNA dependent 5' nuclease activity of Taq DN RX HT with 
5 the all-DNA substrate analogous to IrT (SEQ ID NO: 168) under the same conditions was 
found. 

The effects of KCl and MgS04 concentrations on the 5' nuclease activity of Taq 
DN RX HT G418K/E507Q, Taq DN RX HT, and Tth DN RX HT with the IrT substrate 
are shown in Figure 20C and D. The activities of all enzymes have similar salt 

10 dependencies with an optimal KCl concentration of 1 00 mM for Taq DN RX HT 
G418K/E507Q and Tth DN RX HT and 50 mM for Taq DN RX HT. The optimal 
MgS04 concentration for all enzymes is approximately 8 mM. The analysis of the data 
presented in Figure 20 suggests that the properties of Taq DN RX HT G418K/E507Q are 
much closer to those of Tth DN RX HT rather than Taq DN RX HT confirming the key 

15 role of the G418K/E507Q mutations in the recognition of the substrate with an RNA 
target. 

To understand the mechanism of the reduction of the 5' nuclease activity in the 
presence of an RNA versus a DNA target, the Michaelis constant (Km) and the maximal 
catalytic rate (kcat) of all three enzymes were determined, using an excess of the IrT 

20 substrate (SEQ ID NO: 166) and the downstream probe (SEQ ID NO: 167) and a limiting 
enzyme concentration. For these measurements, ten-jil reactions were assembled 
containing 10 mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% Nonidet P-40, 10 |ag/ml 
tRNA, 4 mM MgCl2, 1 nM of enzyme (Taq DN RX HT, Tth DN RX HT, or Taq DN RX 
HT G418K/E507Q) and different concentrations (0.125, 0.25, 0.5 or 1 ixM) of an 

25 equimolar mixture of the IrT target and the downstream probe. The cleavage kinetics for 
each enzyme and each substrate concentration were measured at 46°C. Reactions were 
stopped by the addition of 10 |j-l of 95% formamide containing 10 mM EDTA and 0.02% 
methyl violet (Sigma). One jil of each stopped reaction digest was fi:-actionated on a 20% 
denaturing acrylamide gel (19:1 cross-linked), with 7M urea, and in a buffer of 45 mM 

30 Tris-borate, pH 83, 1 .4mM EDTA. Gels were scanned on an FMBIO- 1 00 fluorescent 
gel scanner (Hitachi) using a 585 nm filter. The fraction of cleaved product (determined 
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from intensities of bands corresponding to uncut and cut substrate with FMBIO Analysis 
software, version 6.0, Hitachi) was plotted as a function of reaction time. The initial 
cleavage rates were determined from the slopes of linear part of the cleavage kinetics and 
were defined as the concentration of cut product divided by the enzyme concentration 

5 and the time of the reaction (in minutes). The Michaelis constant and the maximal 
catalytic rate hzx of each enzyme with IrT substrate were determined from the plots of the 
initial cleavage rate as fimctions of the substrate concentration. 

It was found that all three enzymes have similar values (in the range of 
200-300 nM) and kcat values of approximately 4 min ^ for Taq DN RX HT and Tth DN 

10 RX HT and of 9 min^ for Taq DN RX HT G418K/E507Q. That the G418K/E507Q 

mutations increase the kcat of Taq DN RX HT more than two fold, but have little effect on 
Km suggest that the mutations position the substrate in an orientation more appropriate for 
cleavage, rather than simply increase the binding constant. 

15 EXAMPLE 7 

Use of molecular modeling to further improve RNA-dependent 5' nuclease activity 

A, Point mutants 

To develop enzymes with altered fimction, sequence changes were introduced by 
20 site-specific mutagenesis in predetermined locations or by random mutagenesis. 

Locations for site-specific mutagenesis were chosen based on evidence from chimeric 
studies, relevant published literature, and molecular modeling. Seven additional mutant 
enzymes were developed from the Tth DN RX HT enzyme, and twenty additional mutant 
enzymes were developed from the Taq DN RX HT enzyme, both discussed previously. 
25 Some of the mutant enzymes are the result of multiple mutagenesis reactions, that is, 

more than one change has been introduced to obtain the final product. Mutation reactions 
were done using the Tth DN RX HT construct (SEQ ID NO:273) described in Example 
2C2, or the Taq DN RX HT construct (SEQ ID NO:275), described in Example 2C1 
unless otherwise stated. Plasmid DNA was purified from 200 ml of JM109 overnight 
30 culture using QIAGEN Plasmid Maxi Kit (QIAGEN, Chatsworth, CA) according to the 
manufacturer's protocol to obtain enough starting material for all mutagenesis reactions. 
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All site-specific mutations were introduced using the Transformer Site Directed 
mutagenesis Kit (Clontech) according to the manufacturer's protocol. One of two 
different selection primers, Trans Oligo AlwNI/Spel or Switch Oligo Spel/AlwNI 
(Clontech, Palo Alto CA catalog #6488-1 or catalog #6373-1) was used for all 

5 mutagenesis reactions described. The selection oligo used in a given reaction is 

dependent on the restriction site present in the vector. All mutagenic primers for both the 
site-specific mutagenesis and the random mutagenesis were synthesized by standard 
synthetic chemistry. Resultant colonies for both types of reactions were E,coli strain 
JM109. Random mutagenesis methods are described below. 

10 Mutants were tested via the rapid screening protocol detailed in Example 1 . 

Then, if more detailed analysis was desired, or if a larger protein preparation was 
required, expression and purification of mutant proteins was done as detailed in Example 
3. 

15 1. Construction of Tth DN RX HT H641A, Tth DN RX HT H748A, Tth 

DN RX HT H786A 

Site specific mutagenesis was performed on pTrc99A Tth DN RX HT DNA using 
the mutagenic primer 583-001-02: 5'-gct tgc ggt ctg ggt ggc gat gtc ctt ccc ctc-3' (SEQ 
ID NO:294) to introduce the H641 A mutation (DNA sequence SEQ ID NOrlOl; amino 

20 acid sequence SEQ ID NO:34), or the mutagenic primer 583-001-03: 5' cat gtt gaa ggc 
cat ggc etc cgc ggc etc cct-3' (SEQ ID NO:295) to generate the H748A mutant (DNA 
sequence SEQ ID NO:102; amino acid sequence SEQ ID NO:35), or the mutagenic 
primer 583-001-04: 5'-cag gag gag etc gtt ggc gac ctg gag gag-3' (SEQ ID NO:296) to 
generate the H786A mutant enzyme (DNA sequence SEQ ID NO: 103; amino acid 

25 sequence SEQ ID NO:36). 

2. Construction of Tth DN RX HT (H786A/G506K/Q509K) 
Starting with the mutant Tth DN RX HT H786A, generated above, site specific 

mutagenesis was done using the mutagenic primer 604-022-02: 5*-gga gcg ctt gcc tgt ctt 
ctt cgt ctt ctt caa ggc ggg agg cct-3' (SEQ ID NO:297) to generate this variant termed 
30 "TthAKK", (DNA sequence SEQ ID NO: 104; amino acid sequence SEQ ID NO:37), 

3. Construction of Taq DN RX HT (W417L/G418K/E507Q/H784A) 



209 



Mutagenic oligonucleotide 158-029-02: 5 '-gag gac cag etc gtt ggc gac ctg aag gag 
cat-3' (SEQ ID NO:298) was used in a site specific mutagenesis reaction to introduce the 
H784A mutation and generate this construct termed "Taq4M" (DNA sequence SEQ ID 
NO:105; amino acid sequence SEQ ID NO:38). 

4. Construction of Taq4M H639A, Taq4M R587A, Taq4M G504K and 
Taq4M G80E 

Site specific mutagenesis was done on the Taq4M mutant, using primer 
473-010-11: 5'-gaggggcgggacatcgccacggagaccgccagc-3' (SEQ ID NO:299) to generate 
the Taq 4M H639A mutant (DNA sequence SEQ ID NO: 106; amino acid sequence SEQ 
ID NO:39), primer 473-010-10: 5'-cag aac ate ccc gtc gcc acc ccg ctt ggg cag-3' (SEQ 
ID NO:300) to generate Taq 4M R587A (DNA sequence SEQ ID NO:107; amino acid 
sequence SEQ ID NO:40), primer 300-081-06: 5 '-ggg ctt ccc gcc ate aag aag acg gag aag 
acc-3' (SEQ ID NO:301) to generate Taq 4M G504K (DNA sequence SEQ ID NO: 108; 
amino acid sequence SEQ ID N0:41), and primer 330-088-04: 5 '-eta ggg ctt ccc gcc ate 
aag aag acg caa aag acc ggc-3' (SEQ ID NO:302) to generate the Taq 4M G80E mutant 
(DNA sequence SEQ ID NO: 109; amino acid sequence SEQ ID NO:42). 

5. Construction of Taq 4M P88E/P90E and Taq 4M L109F/A1 lOT 
Starting with Taq 4M described above, site specific mutagenesis was done using 

primer 473-087-03: 5'-ccg ggg aaa gtc etc etc cgt etc ggc ccg gcc cgc ctt-3' (SEQ ID 
NO:303) to generate the P88E/P90B mutations (DNA sequence SEQ ID NO:l 10; amino 
acid sequence SEQ ID NO:43), or primer 473-087-05: 5'-cgg gac etc gag gcg cgt gaa ccc 
cag gag gtc cac-3' (SEQ ID NO:304) to generate the L109F/A1 lOT mutations (DNA 
sequence SEQ ID N0:1 1 1; amino acid sequence SEQ ID NO:44). 

6. Construction of Taq DN RX HT 
(W417L/G418K/G499R/A502K/I503L /G504K/ E507K/H784A) 

Two PGR reactions were performed, first using construct Taq4M (Taq 
W417L/G418K/G504K/E507Q/H784A) as a template. Using primers 158-84-01 
5'-CTCCTCCACGAGTTCGGC-3' (SEQ ID NO:305) and 535-33-02 5'-ACC GGT 
CTT CTT CGT CTT CTT CAA CTT GGG AAG CCT GAG CTC GTC AAA-3' (SEQ 
ID NO:306) a 620 base pair PGR fragment was generated. Another 510 base pair PGR 
product was generated using primer 535-33-01 5'-AAG ACG AAG AAG ACC GGT 
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AAG CGC TCC ACC AGC-3' (SEQ ID NO:307) and 330-06-03 5'-GTC GAC TCT 
AGA TCA GTG GTG GTG GTG GTG GTG CTT GGC CGC CCG GCG CAT C-3' 
(SEQ ID NO:308). The two PCR products overlap such that a final recombinant PCR 
amplification was done using the outside primers 158-84-01 and 330-06-03 to yield the 

5 1 1 82 base pair product. The recombinant PCR product was digested with the restriction 
enzymes NotI and BamHI according to the manufacturer's instructions to yield a 793 
base pair fragment. The parent plasmid Taq4M was also digested with the same enzymes 
and used as the vector for Ugation. All DNA fragments were TAE agarose gel purified 
prior to ligation. The fragment was ligated into the vector, and transformed into JM109 

10 cells, thus incorporating the mutations G499R, A502K, I503L, and E507K as well as the 
restriction endonuclease site, Agel. This construct is termed "Taq 8M" (DNA sequence 
SEQ ID NO: 1 12; amino acid sequence SEQ ID NO:45). 

B. Random Mutagenesis 

15 Numerous enzymes with altered fimction were generated via random 

mutagenesis. The regions of the protein targeted for random mutagenesis were chosen 
based on molecular modeling data and from information in the literatxure. Different 
mutagenic primers were used to introduce mutations into different regions of the protein. 
Random mutagenesis was performed on the Taq variant Taq 4M G504K (Taq DN RX 

20 HT W417L/G418K/G504K/E507Q/H784A/) (SEQ ID NO:108) described above and 
mutant PCR fragments generated in the mutagenesis reaction were exchanged for 
homologous regions in TaqSM (SEQ ID N0:112) unless otherwise stated. 

Random mutagenesis was also performed on the Tth DN RX HT H786A (SEQ ID 
NO: 103) described above. Mutant PCR fragments generated with the Tth DN RX HT 

25 H786A template were exchanged for homologous regions in the unaltered Tth DN RX 
HT H786A. 



1. Random mutants in amino acid residues 500-507 or 513-520 

The first mutagenic oligonucleotide, 535-054-01: 5'-gga gcg ctt acc ggt ctt (ttg 
30 cgt ctt ctt gat ctt ggg aag) cct tag etc gtc aaa gag-3 ' (SEQ ID NO:309) was used in 

conjunction with 158-84-01: 5'-CTC CTC CAC GAG TTC GGC-3' (SEQ ED NO:310) to 



211 



install random residues from amino acid position 500 to 507 of Taq polymerase variant 
Taq DN RX HT W417L/G418K/G504KyE507Q/H784A (SEQ ID NO: 108). This was 
accomplished by synthesizing the primer 535-054-01 such that only 91% of the bases 
within the parenthesis are unaltered while the remaining 9% of the bases are an equal 
mixture of the other 3 nucleotides. The initial, unaltered sequence of this oligo includes 
the G499R, A502K and the Q507K changes. 

To generate mutations in the region 500-507, primer 535-054-01 and primer 
158-84-01 were used in a PGR reaction, using the Advantage cDNA PGR kit (Glonetech) 
and Taq variant described above, as the target. This PGR fragments was then run on a 
1% TAB agarose gel, excised and purified with QIAquick Gel Extraction Kit (Qiagen, 
Valencia GA, catalog # 28706). The purified fragment was cut with NotI and Agel and 
ligated into pTaq8M that had been linearized with NotI and Agel. JM109 E.coli cells 
(Promega) were transformed with the ligated products. Glones were tested as described 
below. 

The second mutagenic oligonucleotide (used in a separate reaction) 535-054-02: 
5'-caa aag acc ggt aag cgc (tec acc age gcc gcc gtc ctg gag) gcc etc cgc gag gcc cac-3' 
(SEQ ID N0:311) was used in conjunction with 330-06-03: 5'-GTG GAG TGT AGA 
TGA GTG GTG GTG GTG GTG GTG GTT GGG GGC CGG GGG GAT G-3' (SEQ ID 
NO:312) to install random residues from amino acid 513-520. The bases within the 
parenthesis of primer 535-054-02 are also 91% wild-type and 3% each of the other 3 
nucleotides. 

To generate mutations in the region 513-520, primer 535-054-02 and primer 
535-054-02 were used in a PGR reaction with Taq DN RX HT W417L/G418K/G504K/ 
E507Q/H784A (SEQ ID NO: 108) as template, as described above. The resulting PGR 
fragment was purified as above and cut with the restriction enzymes Agel and BamHI. 
The cut fragment was then ligated mto the Taq8M construct, also linearized with Agel 
and BamHI. JM109 E.coli cells were transformed with the ligated products. Glones 
were tested as described Example 1. Mutants developed from these include: 

Taq DN RX HT W417L/G418K/G499R/A502K/K504N/E507iaH784A (Ml-13) (DNA 
sequence SEQ ID N0:1 13; amino acid sequence SEQ ID NO:46). 
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Taq DN RX HT W417L/G418K/G499R/L500I/A502K/G504KyQ507Hy«784A (Ml-36) 
(DNA sequence SEQ ID NO: 114; amino acid sequence SEQ ID NO:47). 
Taq DN RX HT W417L/G418K/G499RyA502K/I503L/G504K/E507KyT514S/H784A 
(M2-24) (DNA sequence SEQ ID NO: 115; amino acid sequence SEQ ID NO:48). 
5 Taq DN RX HT W417L/G418K/G499R/A502K/I503L/G504KyE507K/ V518L/H784A 
(M2-06) (DNA sequence SEQ ID N0:1 16; amino acid sequence SEQ ID NO:49). 
2. TthDN RX HT H786A random mutagenesis 
To generate mutants in the helix-hairpin-helix region of the TthDN RX HT 
H786A (SEQ ID NO:36) enzyme, two different PGR reactions were performed using the 
10 H786A (SEQ ID NO: 103) mutant as a template. The two PGR products overlap such that 
a recombinant PGR reaction can be performed (Higuchi, in PGR Technology, H. A. 
Erlich, ed., Stockton Press, New York. pp61-70 [1989]). This final PGR product is then 
exchanged with the homologous region of the TthDN H786A mutant by using restriction 
enzyme sites located on the ends of the fragment and within the TthDN H786A sequence. 
15 Starting with TthDN H786A discussed above, and using primer 604-08-06: 5 '-gtc 

gga ggg gtc ccc cac gag-3' (SEQ ID NO:313) and primer 390-76-08: 5'-tgt gga att gtg 
age gg (SEQ ID NO:314), a 620 base pair PGR fragment was generated. PGR reactions 
were performed using the Advantage cDNA PGR kit (Glontech) according to 
manufacturer's instructions. This PGRproduct includes amino acids 1-194. No 
20 mutations were introduced via this reaction, however the restriction enzyme site EcoRI is 
present at the 5' end. 

Starting with TthDN RX HT H786A discussed above, and using mutagenic 
primer 604-08-05: 5 '-etc gtg ggg gac ccc tec gac aac etc (ccc ggg gtc aag ggc ate ggg 
gag aag acc gee) etc aag ctt etc aag-3' (SEQ ID NO:315) and primer 209-74-02: 5'-gtg 
25 gcc tec ata tgg gcc agg ac-3' (SEQ ID N0:3 16) a 787 base pair PGR fragment was 
generated. PGR reactions were done as above. This fragment does contain random 
mutations, due to the presence of the mutagenic primer, 604-08-05. The bases within the 
parenthesis of this primer were synthesized such that 91% of the sequence is wild-type, 
while the additional 9% is evenly divided between the remaining 3 bases. 
30 The two PGR fragments overlap, and were combined in a recombinant PGR 

reaction. Primers 390-76-08 and 209-74-02 were added, and the Advantage cDNA PGR 
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kit (Clontech) was again used according to manufacturer's instructions. A 1380 base pair 
product was generated from this reaction. 

The recombinant PGR product was cut with the restriction enzymes EcoRI and 
NotI according to the manufacturer's instructions to yield a 986 base pair fragment. 
5 TthDN RX HT H786A was prepared by cutting with the same enzymes. The fragment 
was then ligated into the vector, and transformed into JM109 cells. New mutants 
developed from this set of reactions include: 

TthDN RX HT H786A/P197R/K200R (DNA sequence SEQ ID NO: 117; amino acid 
10 sequence SEQ ID NO:50). 

TthDN RX HT H786A/K205Y (DNA sequence SEQ ID N0:1 18; amino acid sequence 
SEQIDN0:51). 

TthDN RX HT H786A/G203R (DNA sequence SEQ ID N0:1 19; amino acid sequence 
SEQ ID NO:52). 

15 3. Construction of Taq DN RX HT W417L/G418K7H784A 

L109F/A110T/G499R/A502K/I503L/G504K/E507K/T514S (Taq SS) 

Starting with Taq DN RX HT 
W417L/G418K/G499R/A502Kyi503L/G504Km507K/T514S/H784A (SEQ ID N0:115) 

mutant described above, primer 473-087-05: 5 '-egg gac etc gag geg cgt gaa cec cag gag 
20 gtc cac-3 ' (SEQ ID N0:3 1 7) was used in conjunction with the appropriate selection 

primer in a site specific mutagenesis reaction to incorporate the L109F and Al lOT 

mutations to generate this enzyme, termed "TaqSS" (DNA sequence SEQ ID NO: 120; 

amino acid sequence SEQ ID NO:53). 

4. Construction of Taq DN RX HT W417L/G418L/H784A 
25 P88E/P90E/G499R/A502K/I503L/G504K/E507K/T514S 

Starting with Taq DN RX HT 
W417L/G418K/G499RyA502K/I503L/G504K/E507K/T514S/H784A (SEQ ID N0:115) 
mutant described above, primer 473-087-03: 5'-ceg ggg aaa gtc etc etc cgt etc ggc ccg 
gcc cgc ctt-3' (SEQ ID NO:318) was used in conjunction with the appropriate selection 
30 primer in a site specific mutagenesis reaction to incorporate the P88E and P90E 
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mutations to generate this enzyme (DNA sequence SEQ ID NO: 121; amino acid 
sequence SEQ ID NO:54). 

5, TaqSS random mutagenesis 

Random mutagenesis was used to introduce additional changes in the 
5 helix-hairpin-helix domain of the TaqSS mutant (SEQ ID NO: 120), The mutagenesis 
was done as described in example 9 above. In the first step, two different but overlapping 
PGR products were generated. One of the PGR products, generated with oligos 
390-76-08 (SEQ ID NO:314), and 604-08-04: 5'-gtc ggactc gtc acc ggt cag ggc-3' (SEQ 
ID NO:319) incorporates the EcoRI site into the fragment, but does not incorporate any 
10 mutations. The second PGR product utihzes mutagenic primer 604-08-03: 5'-ctg acc ggt 
gac gag tec gac aac ctt (ccc ggg gtc aag ggc ate ggg gag aag acg gcg) agg aag ctt ctg 
gag-3' (SEQ ID NO:320) and primer 209-74-02 (SEQ ID NO:316). This fragment 
contains random point mutations, and when combined via recombinant PGR with the first 
fragment, can be cut with the restriction enzymes EcoRI and NotI, and ligated into the 
15 TaqSS construct, also cut with EcoRI and Notl. The hgated construct was then 
transformed into JM109, Golonies were screened as described below. Enzymes 
developed from this mutagenesis include: 

TaqSS K198N (DNA sequence SEQ ID N0:1 12; amino acid sequence SEQ ID NO:55), 
20 TaqSS A205Q (DNA sequence SEQ ID NO: 123; amino acid sequence SEQ ID NO:56). 
TaqSS I200M/A205G (DNA sequence SEQ ID NO: 124; amino acid sequence SEQ ID 
NO:57). 

TaqSS K203N (DNA sequence SEQ ID NO: 125; amino acid sequence SEQ ID NO:58). 
TaqSS T204P (DNA sequence SEQ ID NO: 126; amino acid sequence SEQ ID NO:59). 

25 6. Construction of TaqSS R677A 

To generate enzymes with sequence changes in both the arch region and in the 
polymerase region, additional specific point mutations were generated in TaqSS. Site 
specific mutagenesis was performed as described above using the oligo 473-060-10: 
5'-tag etc ctg gga gag ggc gtg ggc cga cat gcc-3' (SEQ ID NO:321) to generate the 

30 TaqSS R677A mutant (DNA sequence SEQ ID NO:127; amino acid sequence SEQ ID 
NO:60), 
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7. Construction of TaqTthAKK (DNA sequence SEQ ID NO:128; amino 
acid sequence SEQ ID NO:61) and TthTaqSM (DNA sequence SEQ ID NO:129; 
amino acid sequence SEQ ID NO:62) 

Chimeric mutant TaqTthAKK and TthTaqSM were generated by cutting Tth DN 
5 RX HT (H786A/G506K/Q509K) (SEQ ID NO:104; here abbreviated TthAKK) or Taq 
4M G504 (SEQ ID NO:108; here abbreviated Taq 5M) with the restriction endonucleases 
EcoRI and Notl. The smaller insert fragments as well as the larger vector fragments were 
gel purified as detailed in Example 3D, and the insert fragments were exchanged between 
the two mutants and ligated as described in Example 3D. Screening and verification of 
10 the construct sequence was also done as in Example 3D. 

EXAMPLE 8 

Improvement of RNA-dependent 5' nuclease activity in other polymerases 

15 Information gained from the TaqPol/TthPol recombinations, mutagenesis and 

modeling, was used to make comparable mutations in additional DNA polymerases and 
examined the effects on the cleavage activities of these enzymes. The DNA polymerases 
of Thermus filiformus (TfiPol) and Thermus scotoductus (TscPol) were cloned and 
purified as described in Example 2. The mutagenesis of these two proteins is described 

20 below. 

A, Construction of TfiPolDN2M 

Mutagenesis of pTrc99a-TfiPol (SEQ ID NO:249) was done using the 
QuikChange site-directed mutagenesis kit (Stratagene) according to the manufacturer's 
protocol. The P420K mutation was made with the following two oHgonucleotides; 

25 5'-CTTCCAGAACCTCTTTAAACGGCTTTCCGAGAAG (SEQ ID NO:322) and 
5'-CTTCTCGGAAAGCCGTTTAAAGAGGTTCTGGAAG (SEQ ID NO:323). The 
E507Q mutation was made with the following two oligonucleotides; 
5'-CCGGTGGGCCGGACGCAGAAGACGGGCAAGC (SEQ ID NO:324) and 
5'-GCTTGCCCGTCTTCTGCGTCCGGCCCACCGG (SEQ ID NO:325). The D785N 

30 mutation was made with the following two oHgonucleotides; 

5'-CTCCTCCAAGTGCACAACGAGCTGGTCCTGG (SEQ ID NO:326) and 
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5'-CCAGGACCAGCTCGTTGTGCACTTGGAGGAG (SEQ ID NO:327). The plasmid 
containing all three mutations is called pTrc99a-TfiPolDN2M, (DNA sequence SEQ ID 
NO:130; amino acid sequence SEQ ID NO:63). 
B. Construction of TscPolDN2M 
5 Mutagenesis of pTrc99a-TscPol (SEQ ID NO:253) was done with the 

QuikChange site-directed mutagenesis kit (Stratagene) according to the manufacturer's 
protocol. The E416K mutation was made with the following two oligonucleotides; 
5'-GCCGCCCTCCTGAAGCGGCTTAAGGG (SEQ ID NO:328) and 
5'-CCCTTAAGCCGCTTCAGGAGGGCGGC (SEQ ID NO:329). The E505Q mutation 
10 was made with the following two oligonucleotides; 

5'-ATCGGCAAGACGCAGAAGACGGGCAAGC (SEQ ID NO:330) and 
5'-GCTTGCCCGTCTTCTGCGTCTTGCCGAT (SEQ ID NO:33 1). The D783N 
mutation was made with the following two oUgonucleotides; 
5'-TTGCAGGTGCACAACGAACTGGTCCTC (SEQ ID NO:332) and 
15 5'-GAGGACCAGTTCGTTGTGCACCTGCAA(SEQIDNO:333). The plasmid 

containing all three mutations is called pTrc99a-TscPolDN2M, (DNA sequence SEQ ID 

N0:131; amino acid sequence SEQ ID NO:64). 

C. Chimerics of Tsc, Tfl, Tth and Taq mutants 

1. Construction of TfiTth AKK (DNA sequence SEQ ID NO:132; amino 
20 acid sequence SEQ ID NO:65), TscTthAKK (DNA sequence SEQ ID 

NO:I33; amino acid sequence SEQ ID NO:66), TfiTaqSM (DNA 
sequence SEQ ID NO:134; amino acid sequence SEQ ID NO:67) and 
TscTaqSM (DNA sequence SEQ ID NO: 135; amino acid sequence 
SEQ ID NO:68) 

25 To generate chimeric enzymes between Tth DN RX HT (H86A/G506K/Q509K) 

(here abbreviated TthAKK, SEQ ID NO: 104) or Taq 4M G504 (here abbreviated Taq 
5M, SEQ ID NO:108), and Tfi DN 2M (SEQ ID NO:130), or Tsc DN 2M (SEQ ID 
NO: 131), additional restriction endonuclease sites were introduced by site specific 
mutagenesis into the named Tfi and Tsc mutants. Mutagenic primers 700-01 1-01 5'-cag 

30 acc atg aat tec acc cca ctt ttt gac ctg gag-3' (SEQ ID NO:334) and 700-01 1-02 5'-gtg gac 
gcg gcc gcc cga ggc cgc cgc cag ggc cag-3' (SEQ ID NO:335) were used to introduce an 
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EcoRI site at amino acid position 1 and a NotI site at amino acid position 33 1 in Tfi DN 
2M. Mutagenic primers 700-01 1-03 5'-cag acc atg aat tec ctg ccc etc ttt gag ccc aag-3' 
(SEQ ID NO:336) and 700-01 1-04 5'-gta aac cgc gcc gcc cca ggc ggc gge caa ggc gtt-3' 
(SEQ ID NO:337) were used to introduce an EcoRI site at amino acid position 1 and a 

5 NotI site at amino acid position 327 in Tsc DN 2M. PGR reactions were done using the 
Advance cDNA PGR kit (Glonetech) according to manufacturer's instructions and either 
Tfi DN 2M or Tsc DN 2M as target, with their corresponding primers. The 1017 base 
pair PGR products were cut with both EcoRI and NotI to yield 993 base pair insert 
fragments that were gel purified as described in Example 3D. The mutants Taq4M 

10 G504K (SEQ ID NO:108) and Tth DN RX HT (H786A/G506K/Q509K) (SEQ ID 
NO: 104) were also cut with EcoRI and NotI, and the larger, vector firagment was gel 
isolated as above. Ligations were performed as detailed in Example 3D, as was the 
screening and verification of the new constructs. 

15 EXAMPLE 9 

Additional Enzymes having Improved RNA-dependent 5' nuclease activity 



Generation of Tfi DN 2M(AN) 

20 To facilitate later cloning steps, an endogenous restriction en2yme site (Not I) was 

removed from the polymerase region of the TfiPolDN2M gene (SEQ ID NO: 130 
described in Example 8A), and a unique Not I site was inserted in a more advantageous 
position. 

The endogeneous Not I site was removed as follows. The QUIKGHANGE 
25 Site-Directed Mutagenesis Kit from (Stratagene) was used according to manufacturer's 
instructions with the mutagenic primers 5'-gag-gtg-gag-cgg-ccc-ctc-tcc-cgg-gtc-ttg (SEQ 
ED NO: 338) and 5'-caa-gac-ccg-gga-gag-ggg-ccg-ctc-cac-ctc (SEQ ID NO: 339). The 
new construct was named Tfi DN 2M(AN) (DNA sequence SEQ ID NO: 340; amino acid 
sequence SEQ ID NO: 341). 
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Generation of TH DN 2M(N), Tsc DN 2M(N) 

To install a unique NotI site (at amino acid position 328) in Tfi DN 2M(AN) 
(SEQ ID NO: 340), primers 886-088-07 (SEQ ID NO: 342) 5'-tgg-cgg-cgg-cct-cgg-gcg- 
gcc-gcg-tcc-acc-ggg-caa-ca-3' and 700-010-03 5'-ctt-ctc-tca-tcc-gcc-aaa-aca-gcc (SEQ 

5 ID NO: 343) were used in a PGR reaction with Tfi DN 2M(AN) (SEQ ID NO: 340) as 
template. The resulting PGR fragment was purified and cut with the restriction enzymes 
NotI and Sail. The cut fragment was then hgated into the TfiTthAKK (SEQ ID NO: 132, 
described in example 8,C) construct which was also digested with NotI and Sail. The 
new construct was termed Tfi DN 2M(N) (DNA sequence SEQ ID NO: 345; amino acid 

10 sequence SEQ ID NO: 346). 

To introduce a Not I restriction endonuclease site into mutant TscPol DN 2M 
(previously described in Example 8B above), PGR was performed with primers: 886-088- 
05 (SEQ ID NO: 344) 5'-tgg-ccg-ccg-cct-ggg-gcg-gcc-gcg-ttt-acc-ggg-cgg-ag-3' and 
15 700-010-03 (SEQ ID NO: 343) 5'-ctt-ctc-tca-tcc-gcc-aaa-aca-gcc-3' using TscPolDN2M 
(SEQ ID NO: 131) as template. The Notl-Sall digested fragment of the PGR product was 
then sub-cloned into NotI and Sail digested TscTthAKK vector (SEQ ID NO: 133). The 
resulting construct was termed Tsc DN 2M(N) (DNA sequence SEQ ID NO:347; amino 
acid sequence SEQ ID NO:348). 

20 

Generation of Tfi DN 2M(N)AKK and Tsc DN2M(N)AKK 

To generate the "AKK" set of mutations (G504K/E507K7H784A/D785N), site 
specific mutagenesis was done on the Tfi DN 2M(N) mutant (DNA sequence SEQ ID 
NO: 345), using primers 959-022-01 to -04: 5'-ggc-ctc-acc-ccg-gtg-aag-cgg-acg-aag-aag- 
25 acg-ggc-aag-cgc-3', 5'-gcg-ctt-gcc-cgt-ctt-ctt-cgt-ccg-ctt-cac-cgg-ggt-gag-gcc-3', 5'-ctc- 
ctc-ctc-caa-gtg-gcc-aac-gag-ctg-gtc-ctg-3', 5'-cag-gac-cag-ctc-gtt-ggc-cac-ttg-gag-gag- 
gag-3' (SEQ ID NO: 354-357) to generate the Tfi 2M(N)AKK mutant (DNA sequence 
SEQ ID NO: 358; amino acid sequence SEQ ID NO: 359). This construct is termed 
"TfiAKK". 

30 To install the "AKK" set of mutations (G502K/E505K/H782A/D783N), into the 

TscDN 2M (N) construct, (DNA sequence SEQ ID NO: 347) primers 959-022-05 to -08: 
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5'-ggg-ctt-ccc-gcc-atc-aag-aag-acg-aag-aag-acg-ggc-aag-cgc-3', 5'-gcg-ctt-gcc-cgt-ctt- 
ctt-cgt-ctt-ctt-gat-ggc-ggg-aag-ccc-3', 5'-atg-ctt-ttg-cag-gtg-gcc-aac-gaa-ctg-gtc-ctc-3', 
5'-gag-gac-cag-ttc-gtt-ggc-cac-ctg-caa-aag-cat-3' (SEQ ID NO: 360-363) were used to 
generate Tsc2M(N)AKK (DNA sequence SEQ ED NO: 364; amino acid sequence SEQ 
5 ID NO: 365). This construct is termed "Tsc AKK". 

Construction of point mutants by recombinant PCR 
Construction of TthAKK(P195A) and TthAKK(P195K) 

To introduce mutations at amino acid position 195 (either a P195A or P195K) in 
10 the nuclease domain of TthAKK construct, mutagenic primer 785-073-01 (P195A) 5'- 
ccc-tcc-gac-aac-ctc-gcc-ggg-gtc-aag-ggc-atc-3' (SEQ ID NO: 370) or 785-073-02 
5 (P195K) 5'-ccc-tcc-gac-aac-ctc-aag-ggg-gtc-aag-ggc-atc-3' (SEQ ID NO: 371) and 

1 primer 209-074-02: 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) were used in 
f: j a PCR reaction to generate a 787 base pair fragment. Another PCR fragment was 

15 obtained by using the primers: 390-076-08 5'-tgt-gga-att-gtg-agc-gg-3' (SEQ ID 
O NO:314) and 785-073-03 5'-gag-gtt-gtc-gga-ggg-gtc-3' (SEQ ID NO: 372) in a reaction 

1 1 with the same template. 

'=P The two PCR fragments overlap and were combined in a recombinant PCR 

2 reaction. The outside primers 390-076-08 and 209-074-02 were added, and the 

20 Advantage cDNA PCR kit (Clontech) was used according to manufacturer' s instructions. 
A 1380 base pair product was generated from this reaction. 

The recombinant PCR product was cut with the restriction enzymes EcoRI and 
NotI to yield a 986 base pair fragment. The TthAKK construct was prepared by cutting 
with the same enzymes. The fragment was then hgated into the vector, and fransformed 

25 into JM109 cells. New mutants developed from this set of reactions include: 

TthAKK(P195A) (DNA sequence SEQ ID NO: 373; amino acid sequence SEQ ID NO: 
374) and TthAKK(P195K) (DNA sequence SEQ ID NO: 375; amino acid sequence SEQ 
ID NO: 376). 

Construction of TthAKK(N417K/L418K) 

30 The same approach was used to construct TthAKK(N4 1 7K/L4 1 8K). Two 

overlapping PCR fi^igments were generated by mutagenic primers: 785-73-07 5'-gag- 
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agg-ctc-cat-cgg-aag-aag-ctt-aag-cgc-ctc-gag-3' (SEQ ID NO: 377) and 700-10-03 5'-ctt- 
ctc-tca-tcc-gcc-aaa-aca-gcc-3' (SEQ ID NO:343), and primers 158-084-01 5'-ctc-ctc-cac- 
gag-ttc-ggc-3' (SEQ ID NO:310) and 785-73-08 5'-ccg-atg-gag-cct-ctc-cga-3' (SEQ ID 
NO: 378). The two products were combined and amplified with outside primers 700-10- 
03 and 158-084-01 . The recombinant PGR product was cut with the restriction enzymes 
NotI and BamHI and ligated into the Notl/BamHI pre-cut TthAKK construct. This 
mutant was termed TthAKK(N417K/L418K) (DNA sequence SEQ ID NO: 379; amino 
acid sequence SEQ ID NO: 380). 

Construction of additional TthAKK point mutants by site-directed mutagenesis 
Construction of TthAKK(P255L) 

Site specific mutagenesis was performed on the TthAKK construct using the 
mutagenic primer 886-049-05 and 886-049-06: 5'-gtg-cgc-acc-gac-ctc-ctc-ctg-gag-gtg- 
gac-ctc-3' (SEQ ID NO: 381), 5'-gag-gtc-cac-ctc-cag-gag-gag-gtc-ggt-gcg-cac-3' (SEQ 
ID NO: 382) to generate TthAKK(P255L) (DNA sequence SEQ ID NO: 383; amino acid 
sequence SEQ ID NO: 384). 

Construction of TthAKK(F311Y) 

Site specific mutagenesis was performed on the TthAKK construct using the 
mutagenic primer 886-049-09 and 886-049-10: 5'-ggg-gcc-ttc-gtg-ggc-tac-gtc-ctc-tcc- 
cgc-ccc-3' (SEQ ID NO: P385), 5'-ggg-gcg-gga-gag-gac-gta-gcc-cac-gaa-ggc-ccc-3' 
(SEQ ID NO: 386) to generate TthAKK(F31 lY) (DNA sequence SEQ ID NO: 387; 
amino acid sequence SEQ ID NO: 388). 

Construction of TthAKK(N221H/R224Q) 

Site specific mutagenesis was performed on the TthAKK construct using the 
mutagenic primer 886-049-01 and 886-049-02: 5'-gaa-aac-ctc-ctc-aag-cac-ctg-gac-cag- 
gta-aag-cca-gaa-aac-3' (SEQ ID NO: 389), 5'-gtt-ttc-tgg-ctt-tac-ctg-gtc-cag-gtg-ctt-gag- 
gag-gtt-ttc-3' (SEQ ID NO: 390) to generate TthAKK(N221H/R224Q) (DNA sequence 
SEQ ED NO: 391; amino acid sequence SEQ ID NO: 392). 
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Construction of TthAKK(R251H) 

Site specific mutagenesis was performed on TthAKK construct using the 
mutagenic primer 886-049-03 and 886-049-04: 5'-gag-ctc-tcc-cgg-gtg-cac-acc-gac-ctc- 
ccc-ctg-3' (SEQ ID NO: 393), 5'-cag-ggg-gag-gtc-ggt-gtg-cac-ccg-gga-gag-ctc-3' (SEQ 
5 ID NO: 394) to generate TthAKK(R25 IH) (DNA sequence SEQ ID NO: 395; amino acid 
sequence SEQ ID NO: 396). 

Construction of TthAKK(P255L/R251H) 

Site specific mutagenesis was performed on the TthAKK(P255L) construct using 
10 the mutagenic primer 886-088-01 and 886-088-02: 5'-gag-ctc-tcc-cgg-gtg-cac-acc-gac- 
ctc-ctc-ctg-3' (SEQ ID NO: 397), 5'-cag-gag-gag-gtc-ggt-gtg-cac-ccg-gga-gag-ctc-3' 
(SEQ ID NO: 398) to generate TthAKK(P255L/R251H) (DNA sequence SEQ ID NO: 
399; amino acid sequence SEQ ID NO: 400). 

15 Construction of Tth AKK L429V (DNA sequence SEQ ID NO: 401 ; amino 

acid sequence SEQ ID NO: 402) 

Palm region randomization mutagenesis was performed on the Tth AKK construct 
using the mutagenic primer 680-79-02 5 '-etc cat egg aac etc ctt [aag cgc etc gag ggg gag 
gag aag etc ctt tgg] etc tac cac gag gtg-3' (SEQ ID NO: 403) and reverse primer 680-79- 
20 04 5 '-AAG GAG GTT CCG ATG GAG-3 ' (SEQ ID NO: 404) to introduce the random 
mutations in the Palm region. Brackets indicate a synthesis of 91% base shown and 3% 
all other bases. 

Construction of Tth AKK E425V (DNA sequence SEQ ID NO: 405; amino 
acid sequence SEQ ID NO: 406) 

25 Palm region randomization mutagenesis was performed on the Tth AKK construct 

using the mutagenic primer 680-79-02 5 '-etc cat egg aac etc ctt [aag cgc etc gag ggg gag 
gag aag etc ctt tgg] etc tac cac gag gtg-3' (SEQ ID NO: 403) and reverse primer 680-79- 
04 5'-AAG GAG GTT CCG ATG GAG-3' (SEQ ID NO: 404) to introduce the random 
mutations in the Pahn region. Brackets indicate a synthesis of 91% base shown and 3% 

30 all other bases. 
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Construction of Tth AKK L422N/E425K (DNA sequence SEQ ID NO: 407; 
amino acid sequence SEQ ID NO: 408) 

Palm region randomization mutagenesis was performed on the Tth AXK construct 
using the mutagenic primer 680-79-02 5'-ctc cat egg aac etc ctt [aag cgc etc gag ggg gag 
gag aag etc ctt tgg] etc tac cac gag gtg-3' (SEQ ID NO: 403) and reverse primer 680-79- 
04 5'-AAG GAG GTT CCG ATG GAG-3' (SEQ ID NO: 404) to introduce the random 
mutations in the Palm region. Brackets indicate a synthesis of 91% base shown and 3% 
all other bases. 

Construction of Tth AKK L422FAV430C (DNA sequence SEQ ID NO: 409; 
amino acid sequence SEQ ID NO: 410) 

Palm region randomization mutagenesis was performed on Tth AKK DNA using 
the mutagenic primer 680-79-02 5 '-etc cat egg aac etc ctt [aag cgc etc gag ggg gag gag 
aag etc ctt tgg] etc tac cac gag gtg-3' (SEQ ID NO: 403) and reverse primer 680-79-04 
5'-AAG GAG GTT CCG ATG GAG-3' (SEQ ID NO: 404) to introduce the random 
mutations in the Palm region. Brackets indicate a synthesis of 91% base shown and 3% 
all other bases. 

Construction of Tth AKK A504F (DNA sequence SEQ ID NO: 411; amino 
acid sequence SEQ ID NO: 412) 

Site saturation mutagenesis was performed on Tth AKK. construct using the 
mutagenic primer 680-80-03 5'-cag gag ctt agg ctt ccc nnn ttg aag aag acg aag aag aca-3' 
(SEQ ID NO: 413) and reverse primer 680-80-06 5'-cct aag etc gtc aaa gag-3' (SEQ ID 
NO: 414) to introduce the random mutations in the A504 amino acid. 

Construction of Tth AKK A504V (DNA sequence SEQ ID NO: 415; amino 
acid sequence SEQ ID NO: 416) 

Site saturation mutagenesis was performed on Tth AKK construct using the 
mutagenic primer 680-80-03 5'-cag gag ctt agg ctt ccc nnn ttg aag aag acg aag aag aca-3' 
(SEQ ID NO: 413) and reverse primer 680-80-06 5'-cct aag etc gtc aaa gag-3' (SEQ ID 
NO: 414) to introduce the random mutations in the A504 amino acid. 
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Construction of Tth AKK A504S (DNA sequence SEQ ID NO: 41 7; amino 
acid sequence SEQ ID NO: 418) 

Site saturation mutagenesis was performed on Tth AKK construct using the 
5 mutagenic primer 680-80-03 5 '-cag gag ctt agg ctt ccc nnn ttg aag aag acg aag aag aca-3 ' 
(SEQ ID NO: 413) and reverse primer 680-80-06 5'-cct aag etc gtc aaa gag-3' (SEQ ID 
NO: 414) to introduce the random mutations in the A504 amino acid. 

Construction of Tth AKK S517G (DNA sequence SEQ ID NO: 419; amino 
10 acid sequence SEQ ID NO: 420) 

Site saturation mutagenesis was performed on Tth AKK construct using the 
mutagenic primer 680-80-07 5'-GGC AAG CGC TCC ACC NNN GCC GCG GTG CTG 
GAG GCC CTA CGG-3' (SEQ ID NO: 421) and reverse primer 680-80-10 5'-GGT 
GGA GCG CTT GCC-3' (SEQ ID NO: 422) to introduce the random mutations in the 
15 S5 17 amino acid. 

Construction of Tth AKK A518L (DNA sequence SEQ ID NO: 423; amino 
acid sequence SEQ ID NO: 424) 

Site saturation mutagenesis was performed on Tth AKK construct using the 
20 mutagenic primer 680-80-07 5'-GGC AAG CGC TCC ACC AGC NNN GCG GTG CTG 
GAG GCC CTA CGG-3' (SEQ ID NO: 425) and reverse primer 680-80-10 5'-GGT 
GGA GCG CTT GCC-3' (SEQ ID NO:422) to introduce the random mutations in the 
A518 amino acid. 

25 Construction of Tth AKK A518R (DNA sequence SEQ ID NO: 426; amino 

acid sequence SEQ ID NO: 427) 

Site saturation mutagenesis was performed on Tth AKK construct using the 
mutagenic primer 680-80-07 5'-GGC AAG CGC TCC ACC AGC NNN GCG GTG CTG 
GAG GCC CTA CGG-3' (SEQ ID NO: 425) and reverse primer 680-80-10 5'-ggt gga 
30 gcg ctt gcc-3' (SEQ ID NO: 422) to introduce the random mutations in the A518 amino 
acid. 
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Construction of TaqSM L451R (DNA sequence SEQ ID NO: 428; amino acid 
sequence SEQ ID NO: 429) 

Site directed mutagenesis was performed on the Taq 5M construct using the 
5 mutagenic primer 240-60-05 5'-acg-ggg-gtg-cgc-cgg-gac-gtg-gcc-tat 3' (SEQ ID NO: 
430) to introduce the L45 IR mutation in the Taq 5M enzyme. 

Construction of Tth AKK A504K (DNA sequence SEQ ID NO: 431; amino 
acid sequence SEQ ID NO: 432) 
10 Site directed mutagenesis was performed on the Tth AKK construct using the 

mutagenic primer 680-69-04 5'-ctt agg ctt ccc aag ttg aag aag acg aag aag aca-3' (SEQ 
ID NO: 433) and reverse primer 680-69-05 5'-tgt ctt ctt cgt ctt ctt caa ctt ggg aag cct aag- 
3' (SEQ ID NO: 434) to introduce the A504K mutation in the Tth AKK enzyme. 

15 Construction of Tth AKK H641 A (DNA sequence SEQ ID NO: 435; amino 

acid sequence SEQ ID NO: 436) 

Site directed mutagenesis was performed on Tth AKK construct using the 
mutagenic primer 680-69-08 5 '-gag ggg aag gac ate gcc acc cag acc gca agc-3' (SEQ ID 
NO: 437) and reverse primer 583-01-02 5'-gct tgc ggt ctg ggt ggc gat gtc ctt ccc etc- 3' 
20 (SEQ ID NO: 438) to introduce the H641 A mutation in the Tth AKK enzyme. 

Construction of Tth AKK T508P (DNA sequence SEQ ID NO: 439; amino 
acid sequence SEQ ID NO: 440) 

Site directed mutagenesis was performed on TthAKK construct using the 
25 mutagenic primer 680-70-0 1 5 ' -ccc-gcc ttg aag aag ccg aag aag aca ggc aag-3 ' (SEQ ID 
NO: 441) and reverse primer 680-70-02 5'-ctt gcc tgt ctt ctt egg ctt ctt caa ggc ggg-3' 
(SEQ ED NO: 442) to introduce the T508P mutation in the Tth AKK enzyme. 

Chimeras and mutations in chimeras. 

30 

Fusion between TthAKK enzyme and alpha-peptide 
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A TthAKK-lacZ-alpha-peptide chimeric fiision was constructed to allow detection 
of mutations (including frame-shifts, deletions, insertions, etc.) which cause the inabihty 
of expression of the full-length fusion protein based on the colony blue-white screening 
(Wu et al., Nucleic Acids Research, 24:1710 [1996]). 

Site specific mutagenesis was performed on TthAKK DNA using the mutagenic 
primers 95 9-04 1 -03 , 5 '-cac-cac-cac-cac-cac-cac-gtc-gac-tag-tgc-tag-cgt-cga-cta-gct-gca- 
ggc-atg-caa-gct-tgg-c-3' (SEQ ID NO: 477) and 959-041-04, 5'-gcc-aag-ctt-gca-tgc-ctg- 
cag-cta-gtc-gac-gct-agc-act-agt-cga-cgt-ggt-ggt-ggt-ggt-ggt-g-3' (SEQ ID NO: 478) to 
generate pTthAKK-L with Sail site following the 6xHis tag at the C-terminus of TthAKK 
for the insertion of lacZ alpha peptide. The alpha peptide (of 201 amino acids) was first 
PGR amplified from the pCRH-TOPO vector (Invitrogen) with primers 959-041-01, 5'- 
cag-gaa-gcg-gcc-gcg-tcg-aca-tga-cca-tga-tta-cgc-caa-gc-3' (SEQ ID NO: 479) and 959- 
093-01, 5'-ggg-ccc-gcc-agg-gtc-gac-tca-ggg-cga-tgg-ccc-act-acg-tga-3' (SEQ ID NO: 
480). The PGR product was then digested with restriction enzyme Sail and ligated into 
the pTthAKK-L vector, which was also cut with the same enzyme to generate the 
chimeric construct TthAKK-alpha peptide (DNA sequence SEQ ID NO: 481; amino acid 
sequence SEQ ID NO: 482). The orientation of the insert was confirmed by sequencing. 

Construction of TthTscAKK and TthTfiAKK enzymes 

The TthAKK construct was cut with the enzymes EcoRI and NotI and the smaller 
insert fragment was gel isolated. The TscAKK or TfiAKK constructs were also cut with 
EcoRI and NotI and the larger fragment was gel isolated and purified. The Tth insert 
(nuclease domain) was Hgated into the TscAKK and TfiAKK vectors (polymerase 
domain) to generate TthTscAKK (DNA sequence SEQ ID NO: 447; amino acid sequence 
SEQ ID NO: 448) and TthTfiAKK (DNA sequence SEQ ID NO: 449; amino acid 
sequence SEQ ID NO: 450) chimeric constructs. 

FT mutations of Tth polymerase to improve substrate specificity 
Construction of Taq(FT)TthAKK 

Site specific mutagenesis was performed on the TaqTthAKK construct using the 
mutagenic primer 473-087-05: 5'-cgg-gac-ctc-gag-gcg-cgt-gaa-ccc-cag-gag-gtc-cac-3' 
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(SEQ ID NO: 483) to introduce the L107F/A108T mutations and generate 
Taq(FT)TthAKK (DNA sequence SEQ ID NO: 484; amino acid sequence SEQ ID NO: 
485). 

Construction of Tfi(FT)TthAKK 

5 Site specific mutagenesis was performed on the TfiTthAKK construct using the 

mutagenic primers 785-096-01: 5'-gtg-gac-ctt-ctg-ggc-ttt-acc-cgc-ctc-gag-gcc-ccg-3' 
(SEQ ID NO: 486) and 785-096-02: 5'-cgg-ggc-ctc-gag-gcg-ggt-aaa-gcc-cag-aag-gtc- 
cac-3' (SEQ ID NO: 487) to introduce the L107FA^108T mutations and generate 
Tfi(FT)TthAKK (DNA sequence SEQ ID NO: 349; amino acid sequence SEQ ID NO: 

10 488). 

Tfi(FT) DN 2M(N) and Tsc(FT) DN 2M(N) mutants 

The L107FA^108T mutations were introduced by isolating the NotI and Sail 
fragment of the Tfi DN 2M(N) mutant (DNA sequence SEQ ID NO: 345) and inserting it 

15 into a Notl-Sall pre-digested Tfi(FT)TthAKK (DNA sequence SEQ ID NO: 349; amino 
acid sequence SEQ ID NO: 488) to yield the Tfi(FT) DN 2M(N) mutant (DNA sequence 
SEQ ID NO: 350, amino acid sequence SEQ ID NO: 351). To add the L107F or the 
E108T mutations into this Tsc-based construct, an identical procedure was done. The 
NotI and Sail cut fragement of Tsc DN 2M(N) mutant (DNA sequence SEQ ID NO: 348) 

20 was inserted into Notl-SaU pre-digested Tsc(FT)TthAKK (DNA sequence SEQ ID NO: 
491) vector to yield Tsc(FT) DN 2M(N) mutant (DNA sequence SEQ ID NO: 352, amino 
acid sequence SEQ ID NO: 353). 

Construction of Tfi(FT) AKK and Tsc(FT) AKK 

25 Starting with the Tfi(FT)DN2M(N) construct described previously (DNA 

sequence SEQ ID NO: 350), primers (959-022-01 to -04: 5'-ggc-ctc-acc-ccg-gtg-aag- 
cgg-acg-aag-aag-acg-ggc-aag-cgc-3', 5'-gcg-ctt-gcc-cgt-ctt-ctt-cgt-ccg-ctt-cac-cgg-ggt- 
gag-gcc-3', 5'-ctc-ctc-ctc-caa-gtg-gcc-aac-gag-ctg-gtc-ctg-3', 5'-cag-gac-cag-ctc-gtt-ggc- 
cac-ttg-gag-gag-gag-3' (SEQ ID NO: 354-357) were used to introduce the "AKK" set of 

30 mutations (H784A, G504K and E507K) by site specific mutagenesis. The resulting 
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mutant construct is termed Tfi(FT)AKK (DNA sequence SEQ ID NO:366, amino acid 
sequence SEQ ID NO:367). 

Likewise, primers 959-022-05 to -08: 5'-ggg-ctt-ccc-gcc-atc-aag-aag-acg-aag- 
aag-acg-ggc-aag-cgc-3', 5'-gcg-ctt-gcc-cgt-ctt-ctt-cgt-ctt-ctt-gat-ggc-ggg-aag-ccc-3', 5'- 
5 atg-ctt-ttg-cag-gtg-gcc-aac-gaa-ctg-gtc-ctc-3',5'-gag-gac-cag-ttc-gtt-ggc-cac-ctg-caa-aag- 
cat-3' (SEQ ID NO: 360-363) were used in a site specific mutagenesis reaction, with the 
Tsc(FT)DN2M(N) mutant (DNA sequence SEQ ID NO: 352) as template to generate the 
Tsc(FT)AKK mutant (DNA sequence SEQ ID NO: 368; amino acid sequence SEQ ID 
NO: 369). 

10 Construction of Tsc(FT)TthAKK 

Site specific mutagenesis was performed on the TscTthAKK construct using the 
mutagenic primers 785-008-03 5'-ttt-acc-cgc-ctc-gag-gtg-ccg-ggc-3' (SEQ ID NO: 489) 
and reverse primer 680-21-03 5'-cgg cac etc gag gcg ggt aaa gcc caa aag gtc cac-3' (SEQ 
ID NO: 490) to introduce the L107F/E108T mutations and generate Tsc(FT)TthAKK 

15 (DNA sequence SEQ ID NO: 454; amino acid sequence SEQ ID NO: 491). 
Construction of Taq(FT)TscAKK and Taq(FT)TfiAKK enzymes 

The Taq(FT)TthAKK construct was cut with the enzymes EcoRI and NotI and the 
smaller insert fi:agment was gel isolated. The TscAKK or TfiAKK constructs were also 
cut with EcoRI and NotI and the larger fragment was gel isolated and purified. The 

20 Taq(FT) insert (nuclease domain) was Ugated into the TscAKK and TfiAKK vectors 
(polymerase domain) to generate the Taq(FT)TscAKK (DNA sequence SEQ ID NO: 
443; amino acid sequence SEQ ID NO: 444) and Taq(FT)TfiAKK (DNA sequence SEQ 
ID NO: 445; amino acid sequence SEQ ID NO: 446) chimeric constructs. 
Construction of TaqEFT-Tth(AKK) (DNA sequence SEQ ID NO:501; amino 

25 acidsequence SEQ ID NO: 502) 

Site specific mutagenesis was performed on Taq(FT)-Tth(AKK) DNA (SEQ ID 
NO: 484) using the mutagenic primers 436-013-08: 5'-atc-gtg-gtc-ttt-gac-gcc-gag-gcc- 
ccc-tcc-ttc-c-3' (SEQ ID NO:503) and 436-013-09 5'-gga-agg-agg-ggg-cct-cgg-cgt-caa- 
aga-cca-cga-t-3' (SEQ ID NO:504) to introduce the K70E mutation. 

30 

Additional mutations to improve enzyme activity of Taq(EFT)TthAKK 
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Construction of TaqEFT-Tth(AKK)-A/Ml (DNA sequence SEQ ID NO:505; amino 
acid sequence SEQ ID NO: 506) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ID 
5 NO: 501) using the mutagenic primers 1044-038-01 : 5'-cag-acc-atg-aat-tcg-gag-gcg-atg- 
ctg-ccc-ctc-ttt-3' (SEQ ID NO:507) and 1044-038-02: (SEQ ID NO: 508) 
5'-aaa-gag-ggg-cag-cat-cgc-ctc-cga-att-cat-ggt-ctg-3' to introduce the G4EA mutation. 

Construction of TaqEFT-Tth(AKK)-B/M2 (DNA sequence SEQ ID NO:509; 
10 amino acid sequence SEQ ID NO: 510) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ID 
NO: 501) using the mutagenic primers 1044-038-03: 5'-gcc-tac-cgc-acc-ttc-ttt-gcc-ctg- 
aag-ggc-ctc-3 and 1044-038-04: 5'-gag-gcc-ctt-cag-ggc-aaa-gaa-ggt-gcg-gta-ggc-3' (SEQ 
ID NO: 51 1 and 512, respectively) to introduce the H29F mutation. 

15 

Construction of TaqEFT-Tth(AKK)-C/M3 (DNA sequence SEQ ID NO:513; 

amino acid sequence SEQ ID NO: 514) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ID 

NO: 501) using the mutagenic primers 1044-038-05: 5'-ctc-ctc-aag-gcc-ctc-aga-gag-gac- 
20 ggg-gac-gcg-3' and 1044-038-06: 5'-cgc-gtc-ccc-gtc-ctc-tct-gag-ggc-ctt-gag-gag-3' (SEQ 

ID NO: 515 and 516, respectively) to introduce the K57R mutation. 

Construction of TaqEFT-Tth(AKK)-D/M5 (DNA sequence SEQ ID NO:517; 

amino acid sequence SEQ ID NO:518) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK:) DNA (SEQ ID 
25 NO: 501) using the mutagenic primers 1044-038-09: 5'-gac-gac-gtc-ctg-gcc-acc-ctg-gcc- 

aag-aag-gcg-3' and 1044-038-10: 5'-cgc-ctt-ctt-ggc-cag-ggt-ggc-cag-gac-gtc-gtc-3' 

(SEQ ID NO: 519 and 520, respectively) to introduce the S125T mutation. 

Construction of TaqEFT-Tth(AKK)-E/M6 (DNA sequence SEQ ID NO:521; 

amino acid sequence SEQ ID NO: 522) 
30 Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ID 

NO: 501) using the mutagenic primers 1044-038-11: 5'-ggg-gag-aag-acg-gcg-ctc-aag-ctt- 
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ctg-gag-gag-3' and 1044-038-12: 5'-ctc-ctc-cag-aag-ctt-gag-cgc-cgt-ctt-ctc-ccc-3' (SEQ 
ID NO: 523 and 524, respectively) to introduce the R206L mutation. 

Construction of TaqEFT-Tth(AKK)-F/M7 (DNA sequence SEQ ID NO:525; amino 
acid sequence SEQ ID NO: 526) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ID 
NO: 501) using the mutagenic primers 1044-038-13: 5'-gag-ccc-gac-cgg-gag-ggg-ctt- 
aag-gcc-ttt-ctg-gag-agg-3' and 1044-038-14: 5'-cct-ctc-cag-aaa-ggc-ctt-aag-ccc-ctc-ccg- 
gtc-ggg-ctc-3 (SEQ ID NO: 527 and 528, respectively) to introduce the R269G and 
R27 IK mutations. 

Construction of TaqEFT-Tth(AKK)-G/M8 (DNA sequence SEQ ID NO:529; amino 
acid sequence SEQ ID NO: 530) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ID 
NO: 501) using the mutagenic primers 1044-038-15: 5'-cac-gag-ttc-ggc-ctt-ctg-gga-ggg- 
gag-aag-ccc-cgg-gag-gag-gcc-ccc-tgg-ccc-3' and 1044-038-16: 5'-ggg-cca-ggg-ggc-ctc- 
ctc-ccg-ggg-ctt-ctc-ccc-tcc-cag-aag-gcc-gaa-ctc-gtg-3' (SEQ ID NO: 531 and 532, 
respectively) to introduce the E290G, S291G, P292E, A294P, and L295R mutations. 

Construction of TaqEFT-Tth(AKK)-H/M9 (DNA sequence SEQ ID NO:533; amino 
acid sequence SEQ ID NO: 534) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ID 
NO: 501) using the mutagenic primers 1044-038-17: 5'-ctg-gcc-ctg-gcc-gcc- tgc-agg- 
ggc-ggc-cgc-gtg-3' and 1044-038-18: 5'-cac-gcg-gcc-gcc-cct-gca-ggc-ggc-cag-ggc-cag-3' 
(SEQ ID NO: 535 and 536, respectively) to introduce the A328C mutation. 

Construction of TaqEFT-Tth(AKK)-I/M10 (DNA sequence SEQ ID NO:537; amino 

acid sequence SEQ ID NO: 538) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ID 
NO: 501) using the mutagenic primers 1080-015-01 (SEQ ID NO:539) 5'-ggg gag aag 
acg gcg agg aag ctt ctg aag gag tgg ggg agc-3' and 1080-015-02 (SEQ ID NO: 540) 5'- 
gct ccc cca etc ctt cag aag ctt cct cgc cgt ctt etc ccc-3' to introduce the E210K mutation. 
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Construction of TaqEFT-Tth(AKK)-Ml-9 (DNA sequence SEQ ID NO:541; amino 
acid sequence SEQ ID NO: 542) 

Seven independent PCR reactions were performed, using construct TaqEFT- 

5 Tth(AKK) (SEQ ID: 501) as a template, with the following pairs of mutagenic primers: 
PCR Reaction 1; 1044-038-01 5'-cag-acc-atg-aat-tcg-gag-gcg-atg-ctg-ccc-ctc-ttt-3' (SEQ 
ID NO:507) and 1044-038-04 5'-gag-gcc-ctt-cag-ggc-aaa-gaa-ggt-gcg-gta-ggc-3' (SEQ 
ID NO:512) to yield a 108 base pair fragment, PCR Reaction 2; 1044-038-03 5'-gcc-tac- 
cgc-acc-ttc-ttt-gcc-ctg-aag-ggc-ctc-3' (SEQ ID NO: 511) and 1044-038-06 5'-cgc-gtc- 

10 ccc-gtc-ctc-tct-gag-ggc-ctt-gag-gag-3' (SEQ ID NO: 5 1 6) to yield a 1 1 7 base pair 
fragment, PCR Reaction 3; 1044-038-05 5'-ctc-ctc-aag-gcc-ctc-aga-gag-gac-ggg-gac- 
gcg-3' (SEQ ID NO: 515) and 1044-038-10 5'-cgc-ctt-ctt-ggc-cag-ggt-ggc-cag-gac-gtc- 
gtc-3' (SEQ ID NO: 520) to yield a 237 base pair fragment, PCR Reaction 4; 1044-038- 
09 5'-gac-gac-gtc-ctg-gcc-acc-ctg-gcc-aag-aag-gcg-3' (SEQ ID NO: 519) and 1044-038- 

15 12 5'-ctc-ctc-cag-aag-ctt-gag-cgc-cgt-ctt-ctc-ccc-3' (SEQ ID NO: 524) to yield a 276 base 
pair fragment, PCR Reaction 5; 1044-038-11 5'-ggg-gag-aag-acg-gcg-ctc-aag-ctt-ctg- 
gag-gag-3' (SEQ ID NO: 523) and 1044-038-14 5'-cct-ctc-cag-aaa-ggc-ctt-aag-ccc-ctc- 
ccg-gtc-ggg-ctc-3' (SEQ ID NO: 528) to yield a 228 base pair fragment, PCR Reaction 6; 
1044-038-13 5'-gag-ccc-gac-cgg-gag-ggg-ctt-aag-gcc-ttt-ctg-gag-agg-3' (SEQ ID NO: 

20 527) and 1044-038-16 5'-ggg-cca-ggg-ggc-ctc-ctc-ccg-ggg-ctt-ctc-ccc-tcc-cag-aag-gcc- 
gaa-ctc-gtg-3' (SEQ ID NO: 532) to yield a 1 13 base pair fragment, PCR Reaction 7; 
1044-038-15 5'-cac-gag-ttc-ggc-ctt-ctg-gga-ggg-gag-aag-ccc-cgg-gag-gag-gcc-ccc-tgg- 
ccc-3' (SEQ ID NO: 531) and 1044-038-18 5'-cac-gcg-gcc-gcc-cct-gca-ggc-ggc-cag-ggc- 
cag-3' (SEQ ID NO: 536) to yield a 157 base pair fragment. The seven PCR products 

25 overlap such that PCR ampUfication in the absence of primers yielded the appropriate 
1005 base pair product to introduce the K70E, G4EA, H29F, K57R S125T, R206L, 
R269G, R271K, E290G, S291G, P292E, A294P, L295R, and A328C mutations. The 
product was ftirther amplified using the outside primers: 1044-038-1 (SEQ ED NO: 507) 
and 1044-038-18 (SEQ ID NO: 536) and cloned into pPCR-Script-Amp. From this 

30 construct, the nuclease domain was again amplified using primers: 1 044-038- 1 (SEQ ID 
NO: 507) and 1044-038-18 (SEQ ID NO: 536) and digested with EcoRI and Notl. The 
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digested PGR product was ligated into an EcoRI and NotI digested TaqEFT-Tth(AKK) 
construct and transformed into JM109 for protein expression and screening. 

Construction of TaqEFT-Tth(AKK)-Ml-10 (DNA sequence SEQ ID NO:543; amino 
5 acid sequence SEQ ID NO: 544) 

Seven independent PGR reactions were performed, using construct TaqEFT- 
Tth(AKK) (SEQ ID: 501) as a template, with the following pairs of mutagenic primers: 
PGR Reaction 1; 1044-038-01 5'-cag-acc-atg-aat-tcg-gag-gcg-atg-ctg-ccc-ctc-ttt-3' (SEQ 
ID NO:507) and 1044-038-04 5'-gag-gcc-ctt-cag-ggc-aaa-gaa-ggt-gcg-gta-ggc-3' (SEQ 

10 ID NO:512) to yield a 108 base pair fragment, PGR Reaction 2; 1044-038-03 5'-gcc-tac- 
cgc-acc-ttc-ttt-gcc-ctg-aag-ggc-ctc-3" (SEQ ID NO: 511) and 1044-038-06 5'-cgc-gtc- 
ccc-gtc-ctc-tct-gag-ggc-ctt-gag-gag-3' (SEQ ID NO: 516) to yield a 117 base pair 
fragment, PGR Reaction 3; 1044-038-05 5'-ctc-ctc-aag-gcc-ctc-aga-gag-gac-ggg-gac- 
gcg-3' (SEQ ED NO: 515) and 1044-038-10 5'-cgc-ctt-ctt-ggc-cag-ggt-ggc-cag-gac-gtc- 

15 gtc-3' (SEQ ID NO: 520) to yield a 237 base pair fragment, PGR Reaction 4; 1044-038- 
09 5'-gac-gac-gtc-ctg-gcc-acc-ctg-gcc-aag-aag-gcg-3' (SEQ ID NO: 519) and 1080-42-02 
5'-gc ttc cag get ccc cca etc ctt cag aag ctt gag cgc cgt ctt etc ccc-3' (SEQ ID NO: 546) to 
yield a 299 base pair fragment, PGR Reaction 5; 1080-42-01 5'-ggg gag aag acg gcg etc 
agg ctt ctg aag gag tgg ggg age ctg gaa gc-3' (SEQ ID NO: 545) and 1044-038-14 5'-cct- 

20 ctc-cag-aaa-ggc-ctt-aag-ccc-ctc-ccg-gtc-ggg-ctc-3' (SEQ ID NO: 528) to yield a 228 base 
pair fragment, PGR Reaction 6; 1044-038-13 5'-gag-ccc-gac-cgg-gag-ggg-ctt-aag-gcc-ttt- 
ctg-gag-agg-3' (SEQ ID NO: 527) and 1044-038-16 5'-ggg-cca-ggg-ggc-ctc-ctc-ccg-ggg- 
ctt-ctc-ccc-tcc-cag-aag-gcc-gaa-ctc-gtg-3' (SEQ ID NO: 532) to yield a 1 13 base pair 
fragment, PGR Reaction 7; 1044-038-15 5'-cac-gag-ttc-ggc-ctt-ctg-gga-ggg-gag-aag-ccc- 

25 cgg-gag-gag-gcc-ccc-tgg-ccc-3' (SEQ ID NO: 531) and 1044-038-18 5'-cac-gcg-gcc-gcc- 
cct-gca-ggc-ggc-cag-ggc-cag-3' (SEQ ID NO: 536) to yield a 157 base pair fragment. 
The seven PGR products overlap such that PGR ampUfication in the absence of primers 
yielded the appropriate 1005 base pair product to introduce the K70E, G4EA, H29F, 
K57R, S125T, R206L, E210K, R269G, R271K, E290G, S291G, P292E, A294P, L295R, 

30 and A328G mutations. The product was further ampUfied using the outside primers: 
1044-038-1 (SEQ ID NO: 507) and 1044-038-18 (SEQ ID NO: 536) and cloned into 
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pPCR-Script-Amp. From this construct, the nuclease domain was again ampUfied using 
primers: 1044-038-1 (SEQ ID NO: 507) and 1044-038-18 (SEQ ID NO: 536) and 
digested with EcoRI and Notl. The digested PGR product was ligated into an EcoRI and 
NotI digested TaqEFT-Tth(AKK) construct and transformed into JM109 for enzyme 
expression and screening. 

K69E Mutation of FEN enzymes to further improve substrate specificity 

Construction of Tth(K69E)AKK, Taq(K69E)TthAKK, Tfi(K69E)TthAKK and 
Tsc(K69E)TthAKK and mutants 

Site specific mutagenesis was performed on TthAKK, TaqTthAKiC, TfiTthAKK 
and TscTthAKK DNAs using the mutagenic primers 5'-atc-gtg-gtc-ttt-gac-gcc-gag-gcc- 
ccc-tcc-ttc-c-3' (SEQ ID NO: 492) and 5'-gga-agg-agg-ggg-cct-cgg-cgt-caa-aga-cca-cga- 
t-3' (SEQ ID NO: 493) to introduce the K69E mutation and to generate Tth(K69E)AKK 
(DNA sequence SEQ ID NO: 452; amino acid sequence SEQ ID NO: 494), 
Taq(K69E)ThAKK, (DNA sequence SEQ ID NO: 495; amino acid sequence SEQ ID 
NO: 496), Tfi(K69E)TthAKK (DNA sequence SEQ ID NO: 497; amino acid sequence 
SEQ ID NO: 498)and Tsc(K69E)TthAKK (DNA sequence SEQ ID NO: 499; amino acid 
sequence SEQ ID NO: 500) mutant enzymes. 

Construction of Tsc(167-334)TthAKK 

Two overlapping PGR fragments were generated by primers 390-76-08: 5'-tgt- 
gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 1044-041-01: 5'-ctt-ctc-tca-tcc-gcc-aaa-aca- 
gcc-3' (SEQ ID NO: 451) with template Tth(K69E)AKK, (DNA sequence SEQ ID NO: 
452), and primers 1044-041-02: 5'-ctc-ctc-cac-gag-ttc-ggc-3' (SEQ ID NO: 453) and 
209-074-02: 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) with template 
Tsc(K69E)TthAKK, (DNA sequence SEQ ID NO: 454). Two products were combined 
and amplified with outside primers 390-76-08 and 209-074-02. The recombinant PGR 
product was cut with the restriction enzymes EcoRI and Notl and Ugated into the vector 
TthAKK which was prepared by cutting with the same enzymes to yield Tsc(167- 
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333)TthAKK construct (DNA sequence SEQ ID NO: 455; amino acid sequence SEQ ID 
NO: 456). 

Construction of Tsc(222-334)TthAKK 

5 Two overlapping PGR fragments were generated by primers 390-76-08: 5'-tgt- 

gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 1044-041-03: 5'-ttc-cag-gtg-ctt-gag-gag-gtt- 
ttc-cag-3' (SEQ ID NO: 457) with template Tth(K69E)AKK (DNA sequence SEQ ID 
NO: 452), and primers 1044-041-04: 5'-ctc-ctc-aag-cac-ctg-gaa-cag-gtg-aaa-3' (SEQ ID 
NO: 458) and 209-074-02: 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) with 

10 template Tsc(K69E)TthAKK, (DNA sequence SEQ ID NO: 454). Two products were 
combined and amplified with outside primers 390-76-08 and 209-074-02. The 
recombinant PGR product was cut with the restriction enzymes EcoRI and NotI and 
Ugated into the vector TthAKK which was prepared by cutting with the same enzymes to 
yield Tsc(222-334)TthAKK construct (DNA sequence SEQ ID NO: 459; amino acid 

15 sequence SEQ ID NO: 460). 

Construction of Tfi(222-334)TthAKK 

Two overlapping PGR fragments were generated by primers 390-76-08: 5'-tgt- 
gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 1044-058-05: 5'-gtc-cag-gtt-ctt-gag-gag-gtt- 

20 ttc-cag-3' (SEQ ID NO: 46 1 ) with template Tth(K69E)AKK (DNA sequence SEQ ID 
NO: 452), and primers 1044-058-06: 5'-ctc-ctc-aag-aac-ctg-gac-cgg-gta-aag-3' (SEQ ID 
NO: 462) and 209-074-02: 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) with 
template Tfi(K69E)TthAKK (DNA sequence SEQ ID NO: 497). Two products were 
combined and amplified with outside primers 390-76-08 and 209-074-02. The 

25 recombinant PGR product was cut with the restriction enzymes EcoRI and NotI and 

ligated into the vector TthAKK which was prepared by cutting with the same enzymes to 
yield Tfi(222-334)TthAKK construct (DNA sequence SEQ ID NO: 463; amino acid 
sequence SEQ ID NO: 464). 
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Construction of Tfi(167-334)TthAKK 

Two overlapping PCR fragments were generated by primers 390-76-08 5'-tgt- 
gga-att-gtg-agc-gg-3' (SEQIDNO:314) and 1044-058-07 5'-gac-gtc-ctt-cgg-ggt-gat- 
gag-gtg-gcc-3' (SEQ ID NO: 465) with template Tth(K69E)AKK, (DNA sequence SEQ 

5 ID NO: 452), and primers 1044-058-08 5'-atc-acc-ccg-aag-gac-gtc-cag-gag-aag-3' (SEQ 
ID NO: 466) and 209-074-02 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) 
with template Tfi(K69E)TthAKK, (DNA sequence SEQ ID NO: 498). Two products 
were combined and amplified with outside primers 390-76-08 and 209-074-02. The 
recombinant PCR product was cut with the restriction enzymes EcoRI and NotI and 

10 ligated into the vector TthAKK which was prepared by cutting with the same enzymes to 
yield Tfi(167-333)TthAKK construct (DNA sequence SEQ ID NO: 467; amino acid 
sequence SEQ ID NO: 468). 

Construction of Tsc(lll-334)TthAKK 

15 Two overlapping PCR fragments were generated by primers 390-76-08 5'-tgt- 

gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 1044-058-01 5'-ctc-gag-gcg-ggt-aaa-ccc- 
cag-gag-gtc-3' (SEQ ID NO: 469) with template Tth(K69E)AKK (DNA sequence SEQ 
ID NO: 452), and primers 1044-058-02 5'-ggg-ttt-acc-cgc-ctc-gag-gtg-ccc-ggc-3' (SEQ 
ID NO: 470) and 209-074-02 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) 

20 with template Tsc(K69E)TthAKK (DNA sequence SEQ ID NO: 499). Two products 
were combined and amplified with outside primers 390-76-08 and 209-074-02. The 
recombinant PCR product was cut with the restriction enzymes EcoRI and NotI and 
hgated into the vector TthAKK which was prepared by cutting with the same enzymes to 
yield Tsc(167-333)TthAKK construct (DNA sequence SEQ ID NO: 471; amino acid 

25 sequence SEQ ID NO: 472). 

Construction of Tsc(l-167)TthAKK 

Two overlapping PCR Augments were generated by primers 390-76-08 5'-tgt- 
gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 1044-058-03 5'-aag-cca-ctc-cgg-ggt-gat- 
30 cag-gta-acc-3' (SEQ ID NO: 473) with template Tsc(K69E)TthAKK (DNA sequence 
SEQ ID NO: 499), and primers 1044-058-04 5'-atc-acc-ccg-gag-tgg-ctt-tgg-gag-aag-3' 
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(SEQ ID NO: 474) and 209-074-02 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID 
N0:3 16) with template Tth(K69E)AKK (DNA sequence SEQ ID NO: 452). Two 
products were combined and amplified with outside primers 390-76-08 and 209-074-02. 
The recombinant PGR product was cut with the restriction enzymes EcoRI and NotI and 
ligated into the vector TthAKK which was prepared by cutting with the same enzymes to 
yield Tsc(l-167)TthAKK construct (DNA sequence SEQ ID NO: 475; amino acid 
sequence SEQ ID NO: 476). 

Modification of AfuFEN enzymes 

Construction of pAfuFEN 

Plasmid pAfiiFENl was prepared as described [U.S. Patent application Ser. No. 
09/684,938, WO 98/23774, each incorporated herein by reference in their entireties]. 
Briefly, genomic DNA was prepared from one vial (approximately 5 ml of culture) of 
live A. fulgidus bacteria from DSMZ (DSMZ #4304) with the DNA XTRAX kit (Gull 
Laboratories, Salt Lake City, UT) according to the manufacturer's protocol. The final 
DNA pellet was resuspended in 100 \i\ of TE (10 mM Tris HCl, pH 8.0, 1 mM EDTA). 
One microliter of the DNA solution was employed in a PGR using the ADVANTAGE 
cDNA PGR kit (Glonetech); the PGR was conducted according to manufacturer's 
recommendations. 

The 5' end primer is complementary to the 5' end of the Afu FEN-1 gene except it 
has a 1 base pair substitution to create an Nco I site. The 3' end primer is complemetary 
to the 3' end of the Afu FEN-1 gene downstream from the FEN-1 ORF except it contains 
a 2 base substitution to create a Sal I site. The sequences of the 5' and 3' end primers are 
5'-GGGTGAAGATTTAGGATGGGTGGGGA-3' (SEQ ID NO:617) and 
5'-GGGGGAGGTCGTAGTGGAGATGGTTTTGGTG (SEQ ID NO:618), respectively. 
Cloning of the resulting fragment was as described for the PfuFENl gene, U.S. Patent 
appUcation Ser. No. 5,994,069, incorporated herein in its entirety for all purposes, to 
create the plasmid pTrc99-AFFENl . For expression, the pTrcAftiHis plasmid was 
constructed by modifying pTrc99-AFFENl, by adding a histidme tail to facilitate 
purification. To add this histidine tail, standard PGR primer-directed mutagenesis 
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methods were used to insert the coding sequence for six histidine residues between the 
last amino acid codon of the pTrc99-AFFENl coding region and the stop codon. The 
resulting plasmid was termed pTrcAfuHis. The protein was then expressed as described 
and purified by binding to a Ni++ affinity column. 

5 

Construction of Afu(Y236A), A46-5 

Two overlapping PGR fi-agments were generated fi-om template AfiiFEN (SEQ ID 
]SIO:556) with primers: 785-73-04 5'-ctg-gtc-ggg-acg-gac-gcc-aat-gag-ggt-gtg-aag-3' 
(SEQ ID NO: 547) and 700-10-03 5'-ctt-ctc-tca-tcc-gcc-aaa-aca-gcc-3' (SEQ ID 

10 NO:343), and primers: 390-076-08 5 '-tgt-gga-att-gtg-agc-gg-3' (SEQ ID N0:3 14) and 
785-73-06 5'-gtc-cgt-ccc-gac-cag-aat-3' (SEQ ID NO: 548). The two products were 
combined and amplified with outside primers 700-10-03 and 390-076-08. The 
recombinant PGR product was cut with the restriction enzymes Ncol and Sail and ligated 
into the AfiiFEN construct which was prepared by cutting with the same enzymes to yield 

15 the Afu(Y236A) construct (DNA sequence SEQ ID NO: 549; amino acid sequence SEQ 
ID NO: 550). 

Construction of Afu(Y236R), A56-9 

Two overlapping PGR fi-agments were generated using the template AfiiFEN with 
20 primers: 785-73-05 5 '-ctg-gtc-ggg-acg-gac-agg-aat-gag-ggt-gtg-aag-3' (SEQ ID NO: 

551) and 700-10-03 5'-ctt-ctc-tca-tcc-gcc-aaa-aca-gcc-3' (SEQ ID NO:343), and primers: 
390-076-08 5'-tgt-gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 785-73-06 5'-gtc-cgt-ccc- 
gac-cag-aat-3' (SEQ ID NO: 548). The two products were combined and ampHfied with 
outside primers 700-10-03 and 390-076-08. The recombinant PGR product was cut with 
25 the restriction enzymes Ncol and Sail and ligated into the AfiiFEN construct which was 
prepared by cutting with the same enzymes to yield Afu(Y236R) construct (DNA 
sequence SEQ ID NO: 552; amino acid sequence SEQ ED NO: 553). 
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2. Chimeras of FEN enzymes and Thermus Polymerase derivatives 

The following enzyme constructs combine portions of the AfiiFEN enzyme 
polymerase domain and the polymerase domain of Thermus ploymerases. These 
combinations were designed based on information generated by molecular modeling. 

Construction of Afu336-Tth296(AKK), ATl-1 

Two overlapping PGR fragments were generated. The first fragment was made 
using the AfiiFEN construct (SEQ ID NO:556) as template with the primers: 390-076-08 
5'-tgt-gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 390-065-05 5'-gaa-cca-cct-ctc-aag- 
cgt-gg-3' (SEQ ID NO: 554). The second fragment was made using TthAKK (SEQ ID 
NO:558) as template and the primers: 700-049-01 5'-acg-ctt-gag-agg-tgg-ttc-ctg-gag- 
gag-gcc-ccc-tgg-3' (SEQ ID NO: 555) and 390-076-09 5'-taa-tct-gta-tca-ggc-tg-3' (SEQ 
ID NO:557). The two products contain a region of sequence overlap, and were combined 
and ampUfied with outside primers 390-076-08 and 390-076-09 in a recombinant PGR 
reaction. The recombinant PGR product was cut with the restriction enzymes BspEI and 
Sail and ligated into the AfuFEN construct which was prepared by cutting with the same 
enzymes to yield Afu336-Tth296(AKK) construct (DNA sequence SEQ ID NO: 559; 
amino acid sequence SEQ ID NO: 560). 
Construction of Afu328-Tth296(AKK), AT2-3 

Two overlapping PGR fragments were generated, the first by primers: 390-076-08 
5'-tgt-gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 700-049-02 5'-ggt-tga-ctt-cag-agc-ttt- 
gag-3' (SEQ ID NO: 561) with template AfuFEN (DNA sequence SEQ ID NO:556), and 
the second by primers: 700-049-03 5'-aaa-gct-ctg-aag-tca-acc-ctg-gag-gag-gcc-ccc-tgg-3' 
(SEQ ID NO: 562) and 390-076-09 5'-taa-tct-gta-tca-ggc-tg-3' (SEQ ID NO:557) with 
template TthAKK (DNA sequence SEQ ID NO:558). The two products were combined 
and ampUfied with outside primers 390-076-08 and 390-076-09. The recombinant PGR 
product was cut with the restriction enzymes BspEI and Sail and Ugated into the vector 
pAfuFEN which was prepared by cutting with the same enzymes to yield Afu336- 
Tth296(AKK) construct (DNA sequence SEQ ID NO: 563; amino acid sequence SEQ ID 
NO: 564). 
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Construction of Afu336-Taq5M 

The TaqSM construct (SEQ ID N0:41) was cut with the enzymes NotI and Sail 
and the smaller insert fragment was gel isolated. The Afu336-Tth296(AKK) construct 
(DNA sequence SEQ ID NO: 559) was also cut with the same restriction enzymes and 
5 the larger vector jfragment was purified. The insert (Taq5M polymerase domain) was 
then hgated into the vector to generate the Afu336-Taq5M construct (DNA sequence 
SEQ ID NO: 565; amino acid sequence SEQ ID NO: 566). 

Construction of Afu336-TaqDN 

10 The TaqDNHT construct was cut with the enzymes NotI and Sail and the smaller 

% insert fragment was gel isolated. The Afu336-Tth296(AKK) construct (DNA sequence 

ro SEQ ID NO: 559) was also cut with the same restriction enzymes and the larger vector 

i;; fragment was purified. The insert (TaqDN polymerase domain) was then hgated into the 

5 vector to generate the Afu336-Taq5M construct (DNA sequence SEQ ID NO: 567; amino 

En 15 acid sequence SEQ ID NO: 568). 

^ : Random Chimerization of Thermus Polymerases 

Numerous enzymes with altered fimctions were generated via random 
ll chimerization of Thermus polymerases based on the principal of DNA shuffling (Volkov 

20 and Arnold, Methods in Enzymology 328:456 [2000]). The procedure below was used to 
develop the following chimeras. 

The genes of interest for random chimerization were PGR amplified with primers: 
390-076-08 5'-tgt-gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 209-074-02: 5'-gtg-gcc- 
tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) to generate the approximately 1.4 kbp 
25 templates. About 2 jig of the DNA templates were mixed in equal proportion, and then 
digested with DNase I (0,33 U) in a 30 jil reaction at 15° C for approximately 1 minute to 
generate fragments 50-200 bp in size. DNA fragments were purified in a 4 % agarose gel 
and extracted by QIAEXII gel extraction kit (QIAGEN) according to manufacturer's 
instructions. The purified fragments (10 \\\) were added to 10 |il of 2X PGR pre-mix (5- 
30 fold diluted, cloned Pfii buffer, 0.4 mM each dNTP, 0.06 U/jiil cloned Pfii DNA 

polymerase, STRATAGENE Cat.# 600153, with accompanying buffer) for the fragment 
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reassembly reaction (PGR program: 3 min 94 "C followed by 40 cycles of 30 sec 94 *'C, 1 
min 52 °C, 2 min + 5 s/cycle 72 °C, followed by 4 min at 72 ''C). The reassembled 
products (1 III of a 10-fold dilution) were then PGR ampHfied with apair of nested 
primers: 072-090-01 5'-gag-cgg-ata-aca-att-tca-cac-agg-3' (SEQIDNO: 569) and 189- 

5 082-01 5'-tgc-ccg-gtg-cac-gcg-gcc-gcc-cct-gca-ggc-3' (SEQ ID NO: 570) using the 
GLONTEGH GG melt cDNA PGR kit (Cat.# K1907-Y) according to the manufacturer's 
instructions. The purified PGR products were digested with restriction enzymes EcoRI 
and NotI and then Ugated into the TthAKK construct that was prepared by cutting with 
the same enzymes. The Ugation mixture was transformed into JM109 competent cells 

10 and colonies were screened for enzyme activity. 

1. Generation of Random Chimeras S26 and S36 

The templates used to develop these chimeric enzymes were the nuclease domains 
15 from TthAKK, TaqTthAKK, TscTthAKK and the TfiTthAKK constructs. Two clones 
were found to show improvement of activity based on primary activity screening. 
Random chimeras S26 (DNA sequence SEQ ID NO: 571; amino acid sequence SEQ ID 
NO: 572) and S36 (DNA sequence SEQ ID NO: 573; amino acid sequence SEQ ID NO: 
574) were then sequenced and isolated. 

20 

2. Introduction of L107F/E108T, L109FA^110T and K69E mutations in S26 
and 836 to improve substrate specificity 

Construction of S26(FT) 

Site specific mutagenesis was performed on pS26 DNA using the mutagenic 
25 primers: 785-008-03 5'-ttt-acc-cgc-ctc-gag-gtg-ccg-ggc-3' (SEQ ID NO: 489) and 680- 
21-03 5'-cgg-cac-ctc-gag-gcg-ggt-aaa-gcc-caa-aag-gtc-cac-3' (SEQ ID NO: 490) to 
introduce the L107F/E108T mutations and generate S26(FT) (DNA sequence SEQ ID 
NO: 575; amino acid sequence SEQ ID NO: 576). 
Construction of S36(FT) 
30 Site specific mutagenesis was performed on pS36 DNA using the mutagenic 

primers: 785-096-01: 5'-gtg-gac-ctt-ctg-ggc-ttt-acc-cgc-ctc-gag-gcc-ccg-3' (SEQ ID NO: 
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486) and 785-096-02: 5'-cgg-ggc-ctc-gag-gcg-ggt-aaa-gcc-cag-aag-gtc-cac-3' (SEQED 
NO: 487) to introduce the L109FA^1 lOT mutations and generate S36(FT) (DNA 
sequence SEQ ID NO: 577; amino acid sequence SEQ ID NO: 578). 

Construction of S26(K69E) 

5 Site specific mutagenesis was performed on S26 DNA using the mutagenic 

primers: 5'-atc-gtg-gtc-ttt-gac-gcc-gag-gcc-ccc-tcc-ttc-c-3' (SEQ ID NO: 492) and 5'- 
gga-agg-agg-ggg-cct-cgg-cgt-caa-aga-cca-cga-t-3' (SEQ ID NO: 493) to introduce the 
K69E mutation and to generate S26(K69E) (DNA sequence SEQ ID NO: 579; amino 
acid sequence SEQ ID NO: 580). 

10 Construction of S26(FT/K69E) 

Site specific mutagenesis was performed on S26(FT) DNAs using the mutagenic 

primers: 5'-atc-gtg-gtc-ttt-gac-gcc-gag-gcc-ccc-tcc-ttc-c-3' (SEQ ID NO: 492) and 5'- 
gga-agg-agg-ggg-cct-cgg-cgt-caa-aga-cca-cga-t-3' (SEQ ID NO: 493) to introduce the 
K69E mutation and to generate S26(FT/K69E) (DNA sequence SEQ ID NO: 581; amino 
15 acid sequence SEQ ID NO: 582). 

3. More random chimeras, N3D7, Nl A12 N1C4, and N2C3 

The templates used to generate these chimerics were the nuclease domains of 
Tth(K69E)AKK, Taq(K69E)TthAKK, Tsc(K69E)TthAKK and Tfi(K69E)TthAKK 

20 constructs. Four clones were shown to have improved activity based on the primary 
activity screening. Random chimeras N3D7 (DNA sequence SEQ ID NO: 583; amino 
acid sequence SEQ ID NO: 584), Nl A12 (DNA sequence SEQ ED NO: 585; amino acid 
sequence SEQ ID NO: 586), N1C4 (DNA sequence SEQ ID NO: 587; amino acid 
sequence SEQ ID NO: 588) and N2C3 (DNA sequence SEQ ID NO: 589; amino acid 

25 sequence SEQ ID NO:590) were then sequenced and isolated respectively. 

EXAMPLE 10 
Test For The Dependence Of An Enzyme On 
The Presence Of An Upstream Oligonucleotide 

30 When choosing a structure-specific nuclease for use in a sequential invasive 

cleavage reaction it is preferable that the enzyme have little ability to cleave a probe 1) in 
the absence of an upstream oligonucleotide, and 2) in the absence of overlap between the 



241 



upstream oligonucleotide and the downstream probe oligonucleotide. Figs. 26a-e depicts 
several structures that can be used to examine the activity of an enzyme is confronted 
with each of these types of structures. The structure a (Fig. 26a) shows the alignment of 
a probe oUgonucleotide with a target site on bacteriophage M13 DNA (M13 sequences 
5 shown in Fig. 26 are provided in SEQ ID NO:217) in the absence of an upstream 

oUgonucleotide. Structure b (Fig. 26b) is provided with an upstream oligonucleotide that 
does not contain a region of overlap with the labeled probe (the label is indicated by the 
star). In structures c, d and e (Figs. 26c-e) the upstream oUgonucleotides have overlaps 
of 1, 3 or 5 nucleotides, respectively, with the downstream probe oligonucleotide and 
10 each of these structures represents a suitable invasive cleavage structure. The enzyme 
Pfu FEN-1 was tested for activity on each of these structures and all reactions were 
performed in duplicate. 

Each reaction comprised 1 \iM 5' TET labeled probe oUgonucleotide 89-15-1 
(SEQ ID NO:212), 50 nM upstream oligonucleotide (either oUgo 81-69-2 [SEQ ID 
15 NO:216], oUgo 81-69-3 [SEQ ID NO:215], oUgo 81-69-4 [SEQ ID NO:214], oligo 

81-69-5 [SEQ ID NO:213], or no upstream oligonucleotide), 1 finol M13 target DNA, 10 
mg/ml tRNA and 10 ng of Pfu FEN-1 in 10 ^1 of 10 mM MOPS (pH 7.5), 7.5 mM MgCb 
with 0.05% each of Tween 20 and Nonidet P-40. 

All of the components except the enzyme and the MgCb were assembled in a 
20 final volume of 8 nl and were overlaid with 10 ^1 of CHILLOUT liquid wax. The 

samples were heated to the reaction temperature of 69°C. The reactions were started by 
the addition of the PJu FEN-1 and MgCb, in a 2 (il volume. After incubation at 69°C for 
30 minutes, the reactions were stopped with 10 |il of 95% formamide, 10 mM EDTA, 
0.02% methyl violet. Samples were heated to 90°C for 1 min immediately before 
25 electrophoresis through a 20% denaturing acrylamide gel (19:1 cross-linked), with 7 M 
urea, in a buffer of 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. Gels were then analyzed 
with a FMBIO-100 Hitachi FMBIO fluorescence imager. The resulting image is 
displayed in Fig. 27. 

In Fig. 27, lanes labeled "a" contain the products generated from reactions 
30 conducted without an upstream oligonucleotide (structure a), lanes labeled "b" contain an 
upstream oUgonucleotide that does not invade the probe/target duplex (structure b). 
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Lanes labeled "c", "d" and "e" contain the products generated from reactions conducted 
using an upstream oligonucleotide that invades the probe/target duplex by 1, 3 or 5 bases, 
respectively. The size (in nucleotides) of the uncleaved probe and the cleavage products 
is indicated to the left of the image in Fig, 27, Cleavage of the probe was not detectable 

5 when structures a and b were utilized. In contrast, cleavage products were generated 
when invasive cleavage structures were utiUzed (structures c-e). These data show that the 
Pfu FEN-1 enzyme requires an overlapping upstream oligonucleotide for specific 
cleavage of the probe. 

Any enzyme may be examined for its suitability for use in a sequential invasive 

10 cleavage reaction by examining the abiUty of the test enzyme to cleave structures a-e (it 
is understood by those in the art that the specific ohgonucleotide sequences shown in 
Figs. 26a-e need not be employed in the test reactions; these structures are merely 
illustrative of suitable test structures). Desirable enzymes display little or no cleavage of 
structures a and b and display specific cleavage of structures c-e {Le,, they generate 

15 cleavage products of the size expected from the degree of overlap between the two 
oligonucleotides employed to form the invasive cleavage structure), 

EXAMPLE 11 

Use Of The Products Of A First Invasive Cleavage Reaction To Enable A Second 
20 Invasive Cleavage Reaction With A Net Gain In Sensitivity 

As discussed in the Description of The hivention above, the detection sensitivity 
of the invasive cleavage reaction can be increased by the performing a second round of 
invasive cleavage using the products of the first reaction to complete the cleavage 
structure in the second reaction. In this Example, the use of a probe that, when cleaved in 
25 a first invasive cleavage reaction, forms an integrated INVADER ohgonucleotide and 
target molecule for use in a second invasive cleavage reaction, is illustrated. 

A first probe was designed to contain some internal complementarity so that when 
cleaved in a first invasive cleavage reaction the product C*Cut Probe 1") could form a 
target strand comprising an integral INVADER ohgonucleotide, A second probe was 
30 provided in the reaction that would be cleaved at the intended site when hybridized to the 
newly formed target/INVADER molecule. To demonstrate the gain in signal due to the 
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performance of sequential invasive cleavages, a standard invasive cleavage assay, as 
described above, was performed in parallel. 

All reactions were performed in duplicate. Each standard (Le., non-sequential) 
invasive cleavage reaction comprised 1 |i,M 5' fluorescein-labeled probe oligo 073-182 (5' 

5 Fl-AGAAAGGAAGGGAAGAAAGCGAA-3'; SEQ ID NO:591), 10 nM upstream oligo 
81-69-4 (5*-CTTGACGGGGAAAGCCGGCGAACGTGGCGA-3'; SEQ ID NO:214), 10 
to 100 attomoles of M13 target DNA, 10 mg/ml tRNA and 10 ng of Pfu FEN-1 in 10 (il 
of 10 mM MOPS (pH 7.5), 8 mM MgCl2 with 0.05% each of Tween 20 and Nonidet 
P-40, All of the components except the enzyme and the MgCb were assembled in a 

10 volume of 7 \xl and were overlaid with 10 )al of CHILLOUT liquid wax. The samples 
were heated to the reaction temperature of 62''C. The reactions were started by the 
addition of the Pfu FEN-1 and MgCb, in a 2 ^il volume. After incubation at 62^C for 30 
minutes, the reactions were stopped with 10 [il of 95% formamide, 10 mM EDTA, 0.02%) 
methyl violet. 

15 Each sequential invasive cleavage reaction comprised 1 |aM 5' fluorescein-labeled 

oligonucleotide 073-191 (the first probe or "Probe 1", 5^ Fl-TGGAGGTCAAAACATCG 
ATAAGTCGAAGAAAGGAAGGGAAGAAAT-3'; SEQ ID NO:592), 10 nM upstream 
oligonucleotide 81-69-4 (5'- CTTGACGGGGAAA GCCGGCGAACGTGGCGA-3'; 
SEQ ED NO:214), 1 [iM of 5' fluorescein labeled oligonucleotide 106-32 (the second 

20 probe or "Probe 2", 5^ Fl-TGTTTTGACCT CCA-3^; SEQ ID NO:593), 1 to 100 amol of 
M13 target DNA, 10 mg/ml tRNA and 10 ng of Pfu FEN-1 in 10 yd of 10 mM MOPS 
(pH 7.5), 8 mM MgCla with 0.05% each of Tween 20 and Nonidet P-40. All of the 
components except the enzyme and the MgCh were assembled in a volume of 8 jliI and 
were overlaid with 10 jal of CHILLOUT liquid wax. The samples were heated to the 

25 reaction temperature of 62°C (this temperature is the optimum temperature for aimeaUng 
of Probe 1 to the first target). The reactions were started by the addition of Pfu FEN-1 
and MgCl2, in a 2 |al volume. After incubation at 62°C for 15 minutes, the temperature 
was lowered to 58°C (this temperature is the optimum temperature for annealing of Probe 
2 to the second target) and the samples were incubated for another 1 5 min. Reactions 
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were stopped by the addition of 10 jil of 95% formamide, 20 mM EDTA, 0.02% methyl 
violet. 

Samples from both the standard and the sequential invasive cleavage reactions 
were heated to 90^ C for 1 min immediately before electrophoresis through a 20% 
5 denaturing acrylamide gel (19:1 cross-linked), with 7 M urea, in a buffer of 45 mM 
Tris-Borate, pH 8.3, 1.4 mM EDTA. The gel was then analyzed with a Molecular 
Dynamics Fluorlmager 595. The resulting image is displayed in Fig. 28a. A graph 
showing measure of fluorescence intensity for each of the product bands is shown in Fig. 
28b. 

^ 10 In Fig. 28a, lanes 1-5 contain the products generated in standard invasive cleavage 

O reactions that contained either no target (lane 1), 10 amol of target (lanes 2 and 3) or with 

f 1 100 amol of target (lanes 4 and 5). The uncleaved probe is seen as a dark band in each 

E lane about half way down the panel and the cleavage products appear as a smaller black 

W band near the bottom of the panel, the position of the cleavage product is indicated by an 

r 15 arrow head to the left of Fig. 28a. The gray ladder of bands seen in lanes 1-5 is due to the 
^"!; thermal degradation of the probe and is not related to the presence or absence of the 

m 

m target DNA. The remaining lanes display products generated in sequential invasive 

fi cleavage reactions that contained 1 amol of target (lanes 6 and 7), 10 amol of target 

^ (lanes 8 and 9) and 100 amol of target (lanes 10 and 1 1). The uncleaved first probe 

20 (Probe 1; labeled "1 uncut") is seen near the top of the panel, while the cleaved first probe 
is indicated as "1; cut". Similarly, the uncleaved and cleaved second probe are indicated 
as "2: uncut" and 2: cut," respectively. 

The graph shown in Fig. 28b compares the amount of product generated from the 
standard reaction ("Series 1") to the amount of product generated from the second step of 
25 the sequential reaction ("Series 2"). The level of background fluorescence measured 

from a reaction that lacked target DNA was subtracted from each measurement. It can be 
seen from the table located below the graph that the signal from the standard invasive 
cleavage assay that contained 100 attomoles of target DNA was nearly identical to the 
signal from the sequential invasive cleavage assay in which 1 attomole of target was 
30 present, indicating that the inclusion of a second cleavage structure increases the 

sensitivity of the assay 100 to 200-fold. This boost in signal allows easy detection of 
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target nucleic acids at the sub attomole level using the sequential invasive cleavage assay, 
while the standard assay, when performed using this enzyme for only 30 minutes, does 
not generate detectable product in the presence of 10 attomoles of target. 

When the amount of target was decreased by 10 or 100 fold in the sequential 
invasive cleavage assay, the intensity of the signal was decreased by the same proportion. 
This indicates that the quantitative capability of the invasive cleavage assay is retained 
even when reactions are performed in series, thus providing a nucleic acid detection 
method that is both sensitive and quantitative. 

While in this Example, the two probes used had different optimal hybridization 
temperatures (z.e., the temperature empiracally determined to give the greatest turnover 
rate in the given reaction conditions), the probes may also be selected (Le,, designed) to 
have the same optimal hybridization temperature so that a temperature shift during 
incubation is not necessary. 

EXAMPLE 12 

The Products Of A Completed Sequential Invasive Cleavage Reaction Cannot Cross 
Contaminate Subsequent Similar Reactions 

As discussed in the Description of the Invention, the serial nature of the multiple 
invasive cleavage events that occur in the sequential invasive cleavage reaction, in 
contrast to the reciprocating nature of the polymerase chain reaction and similar doubling 
assays, means that the sequential invasive cleavage reaction is not subject to 
contamination by the products of like reactions because the products of the first cleavage 
reaction do not participate in the generation of new signal in the second cleavage 
reaction. If a large amount of a completed reaction were to be added to a newly 
assembled reaction, the background that would be produced would come from the 
amount of target that was also carried in, combined with the amount of akeady-cleaved 
probe that was carried in. In this Example, it is demonstrated that a very large portion of 
a primary reaction must be introduced into the secondary reaction to create significant 
signal. 

A first or primary sequential invasive cleavage reaction was performed as 
described above using 100 amol of target DNA. A second set of 5 reactions were 
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assembled as described in Ex. 1 1 with the exception that portions of the first reaction 
were introduced and no additional target DNA was included. These secondary reactions 
were initiated and incubated as described above, and included 0, 0.01, 0.1, 1, or 10% of 
the first reaction material A control reaction including 100 amol of target was included 

5 in the second set also. The reactions were stopped, resolved by electrophoresis and 
visualized as described above, and the resulting image is displayed in Fig. 29. The 
primary probe, uncut second probe and the cut 2nd probe are indicated on the left as "1 : 
cut", 2: uncut" and 2: cut", respectively. 

In Fig. 29, lane 1 shows the results of the first reaction with the accumulated 

10 product at the bottom of the panel, and lane 2 show a 1 : 10 dilution of the same reaction, 
to demonstrate the level of signal that could be expected fi-om that level of contamination, 
without further amphfication. Lanes 3 through 7 show the results of the secondary 
cleavage reactions that contained 0, 10, 1, 0,1 or 0.01% of the first reaction material 
added as contaminant, respectively and lane 8 shows a control reaction that had 100 amol 

15 of target DNA added to verify the activity of the system in the secondary reaction. The 
signal level in lane 4 is as would be expected when 10% of the pre-cleaved material is 
transferred (as in lane 2) and 10% of the transferred target material fi-om the lane 1 
reaction is allowed to fiirther amplify. At all levels of further dilution the signal is not 
readily distinguished fi-om background. These data demonstrate that while a large-scale 

20 transfer from one reaction to another may be detectable, cross contamination by the 

minute quantities that would be expected from aerosol or from equipment contamination 
would not be easily mistaken for a false positive result. These data also demonstrate that 
when the products of one reaction are deliberately carried over into a fresh sample, these 
products do not participate in the new reaction, and thus do not affect the level of target- 

25 dependent signal that may be generated in that reaction. 



247 



EXAMPLE 13 

Detection of Human Cytomegalovirus Viral DNA by Invasive Cleavage 

The previous Example demonstrates the ability of the invasive cleavage reaction 
5 to detect minute quantities of viral DNA in the presence of human genomic DNA. In this 
Example, the probe and INVADER oligonucleotides were designed to target the 
3104-3061 region of the major immediate early gene of human cytomegalovirus 
(HCMV) as shown in Fig. 30. In Fig. 30, the INVADER oligo (89-44; SEQ ID NO:219) 
and the fluorescem (Fl)-labeled probe ohgo (89-76; SEQ ED NO:218) are shown annealed 
10 along a region of the HCMV genome corresponding to nucleotides 3057-31 10 of the viral 
DNA (SEQ ID NO:220). The probe used in this Example is a poly-pyrimidine probe and 
as shown herein the use of a poly-pyrimidine probe reduces background signal generated 
by the thermal breakage of probe oligos. 

The genomic viral DNA was purchased from Advanced Biotechnologies, 
15 Incorporated (Columbia, MD). The DNA was estimated (but not certified) by personnel 
at Advanced Biotechnologies to be at a concentration of 170 amol (1x10^ copies) per 
microliter. The reactions were performed in quadruplicate. Each reaction comprised 1 
[iU 5' fluorescein labeled probe oligonucleotide 89-76 (SEQ ID NO:218), 100 nM 
INVADER oligonucleotide 89-44 (SEQ ID NO:219), 1 ng/ml human genomic DNA, and 
20 one of five concentrations of target HCMV DNA in the amounts indicated above each 
lane in Fig. 31, and 10 ng of Pfu FEN-1 in 10 ^il of 10 mM MOPS (pH 7.5), 6 mM MgCb 
with 0.05% each of Tween 20 and Nonidet P-40. All of the components except the 
labeled probe, enzyme and MgCh were assembled in a final volume of 7 ^il and were 
overlaid with 10 \il of CHILLOUT liquid wax. The samples were heated to 95°C for 5 
25 min, then reduced to 62°C. The reactions were started by the addition of probe, Pfu 
FEN-1 and MgCl2, in a 3 |xl volume. After incubation at 62°C for 60 minutes, the 
reactions were stopped with 10 |xl of 95% formamide, 10 mM EDTA, 0.02% methyl 
violet. Samples were heated to 90° C for 1 min immediately before electrophoresis 
through a 20% acrylamide gel (19:1 cross-Unked), with 7 M urea, in a buffer of 45 mM 
30 Tris-Borate, pH 8.3, 1 .4 mM EDTA. Gels were then analyzed with a Molecular 
Dynamics Fluorlmager 595. 
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The resulting image is displayed in Fig. 3 1 . The repUcate reactions were run in 
groups of four lanes with the target HCMV DNA content of the reactions indicated above 
each set of lanes (0-170 amol). The uncleaved probe is seen in the upper third of the 
panel ("Uncut 89-76") while the cleavage products are seen in the lower two-thirds of the 
5 panel ("Cut 89-76"). It can be seen that the intensity of the accumulated cleavage product 
is proportional to the amount of the target DNA in the reaction. Furthermore, it can be 
clearly seen in reactions that did not contain target DNA ("no target") that the probe is 
not cleaved, even in a background of human genomic DNA. While 10 ng of human 
genomic DNA was included in each of the reactions shown in Fig, 31, inclusion of 
10 genomic DNA up to 200 ng has slight impact on the amount of product accumulated. 
The data did not suggest that 200 ng per 10 lal of reaction mixture represented the 
maximum amount of genomic DNA that could be tolerated without a significant 
reduction in signal accumulation. For reference, this amount of DNA exceeds what 
might be found in 0.2 ml of urine (a commonly tested amount for HCMV in neonates) 
15 and is equivalent to the amount that would be found in about 5 |li1 of whole blood. 

These results demonstrate that the standard (i.e., non-sequential) invasive 
cleavage reaction is a sensitive, specific and reproducible means of detecting viral DNA. 
Detection of 1.7 amol of target is roughly equivalent to detection of 10^ copies of the 
virus. This is equivalent to the number of viral genomes that might be found in 0.2 mis 
20 of urine from a congenitally infected neonate (10^ to 10^ genome equivalents per 0.2 mis; 
Stagno et al, J. Infect. Dis., 132:568 [1975]). Use of the sequential invasive cleavage 
assay would permit detection of even fewer viral DNA molecules, faciUtating detection 
in blood (10^ to 10^ viral particles per ml; Pector et al, J. Clin. Microbiol, 30:2359 
[1992]), which carries a much larger amount of heterologous DNA. 
25 From the above it is clear that the invention provides reagents and methods to 

permit the detection and characterization of nucleic acid sequences and variations in 
nucleic acid sequences. The INVADER-directed cleavage reaction and the sequential 
INVADER-directed cleavage reaction of the present invention provide ideal direct 
detection methods that combine the advantages of the direct detection assays (e.g., easy 
30 quantification and minimal risk of carry-over contamination) with the specificity 
provided by a dual or tri oligonucleotide hybridization assay. 
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As indicated in the Description of the Invention, the use of sequential invasive 
cleavage reactions can present the problem of residual uncut first, or primary, probe 
interacting with the secondary target, and either competing with the cut probe for 
binding, or creating background through low level cleavage of the resulting structure. 
This is shown diagramatically in Figs. 32 and 33. In Fig. 32, the reaction depicted makes 
use of the cleavage product from the first cleavage structure to form an INVADER 
oUgonucleotide for a second cleavage reaction. The structure formed between the 
secondary target, the secondary probe and the uncut primary probe is depicted in Fig. 32, 
as the right hand structure shown in step 2a. This structure is recognized and cleaved by 
the 5' nucleases, albeit very inefficiently (i.e., at less than about 1% in most reaction 
conditions). Nonetheless, the resulting product is indistinguishable from the specific 
product, and thus may lead to a false positive result. The same effect can occur when the 
cleaved primary probe creates and integrated INVADER/target (IT) molecule, as 
described in Example 11; the formation of the undesirable complex is depicted 
schematically in Fig. 33, as the right hand structure shown in step 2a. 

The improvements provided by the inclusion of ARRESTOR ohgonucleotides of 
various compositions in each of these types of sequential INVADER assays are 
demonstrated in the following Examples. These ARRESTOR oligonucleotides are 
configured to bind the residual uncut probe from the first cleavage reaction in the series, 
thereby increasing the efficacy of and reducing the non-specific background in the 
subsequent reaction(s). 

EXAMPLE 14 

"ARRESTOR" Oligonucleotides Improve Sensitivity of Multiple Sequential 

Invasive Cleavage Assays 

In this Example, the effect of including an ARRESTOR oligonucleotide on the 
generation of signal using the IT probe system 33 is demonstrated. The ARRESTOR 
oUgonucleotide hybridizes to the primary probe, mainly in the portion that recognizes the 
target nucleic acid during the first cleavage reaction. In addition to examining the effects 
of adding an ARRESTOR oligonucleotide, the effects of using ARRESTOR 
ohgonucleotides that extended in complementarity different distances into the region of 
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the primary probe that composes the secondary IT structure were also investigated. 
These effects were compared in reactions that included the target DNA over a range of 
concentrations, or that lacked target DNA, in order to demonstrate the level of 
nonspecific (z.e., not related to target nucleic acid) background in each set of reaction 
5 conditions. 

The target DNA for these reactions was a fi-agment that comprised the full length 
of the hepatitis B genome from strain of serotype adw. This material was created using 
the polymerase chain reaction from plasmid pAM6 (ATCC #45020D). The PCRs were 
conducted using a vector-based forward primer, oligo # 156-022-001 (5'- 

10 ggcgaccacacccgtcctgt-3'; SEQ ED NO:594) and a reverse primer, oligo #156-022-02 (5'- 
ccacgatgcgtccggcgtag-3'; SEQ ID NO:595) to amplify the full length of the HBV insert, 
an amplicon of about 3.2kb. The cycUng conditions included a denaturation of the 
plasmid at 95°C for 5 minutes, followed by 30 cycles of 95°C, 30 seconds; 60°C, 40 
seconds; and ll^'C, 4 minutes. This was followed by a final extension at IT'C for 10 

15 minutes. The resulting ampHcon, termed pAM6#2, was adjusted to 2 M NHUOAc, and 
collected by precipitation wiht isopropanol. After drying in vacuo, DNA was dissolved 
in 10 mM Tris pH .0, 0. 1 mM EDTA. The concentration was determined by OD200 
measurement, and by INVADER assay with comparison to a standard of known 
concentration, 

20 The INVADER reactions were conducted as follows. Five master mixes, termed 

"A," "B," "C," "D," and "E," were assembled; all mixes contained 12,5 mM MOPS, pH 
7.5, 500 finoles primary INVADER oUgo #218-55-05 (SEQ ID NO:596), 10 ng human 
genomic DNA (Novagen) and 30 ng AfuFENl enzyme, for every 8 jal of mix. Mix A 
contained no added HBV genomic amplicon DNA; mix B contained 600 molecules of 

25 HBV genomic amplicon DNA pAM6 #2; mix C contained 6,000 molecules pAM6 #2; 
mix D contained 60,000 molecules pAM6 #2; and mix E contained 600,000 molecules 
pAM6 #2. The mixes were aUquotted to the reaction tubes, 8 |il/tube: mix A to tubes 1, 
2, 11, 12, 21 and 22; mix B to tubes 3, 4, 13, 14, 23 and 24; mix C to tubes 5, 6, 15, 16, 
25 and 26; mix D to tubes 7, 8, 17, 18, 27 and 28; and mix E to tubes 9, 10, 19, 20, 29 

30 and 30. The samples were incubated at 95''C for 4 minutes to denature the HBV genomic 
amphcon DNA, The reactions were then cooled to 67''C, and 2 jil of a mix containing 
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37.5 mM MgCl2 and 2,5 pmoles 218-95-06 (SEQ ID NO:597) for every 2 jal, was added 
to each sample. The samples were incubated at 6TC for 60 minutes. Three secondary 
reaction master mixes were prepared, all mixes contained 10 pmoles of secondary probe 
oligonucleotide #228-48-04 (SEQ ID NO:598) for every 2 \x\ of mix. Mix 2A contained 

5 no additional oligonucleotide, mix 2B contained 5pmoles "ARRESTOR" oligo # 2 1 8-95- 
03 (SEQ ID NO:599) and mix 2C contained 5 pmoles of "ARRESTOR" oligo # 218-95- 
01 (SEQ ID NO:600). After the 60 minute incubation at 67^C (the primary reaction 
described above), 2 jal of the secondary reaction mix was added to each sample: Mix 2A 
was added to samples #1-10; Mix 2B was added to samples #1 1-20; and Mix 2C was 

10 added to samples #21-30. The temperature was adjusted to 52°C and the samples were 
incubated for 30 minutes at 52°C. The reactions were then stopped by the addition of 10 
111 of a solution of 95% formamide, 5 mM EDTA and 0.02% crystal violet. All samples 
were heated to 95°C for 2 minutes, and 4 ^1 of each sample were resolved by 
electrophoresis through 20% denaturing acrylamide gel (19:1 cross-linked) with 7 M 

15 urea, in a buffer containing 45 mM Tris-Borate (pH8,3) and 1.4 mM EDTA. The results 
were imaged using the Molecular Dynamics Fluoroimager 595, excitation 488, emission 
530. The resulting images are shown in Fig. 34. 

In Fig, 34, Panel A shows the results of the target titration when no ARRESTOR 
oHgonucleotide was included in the secondary reaction; Panel B shows the results of the 

20 same target titration using an ARRESTOR oligonucleotide that extended 2 nt into the 
non-target complementary region of the primary probe; and Panel C shows the results of 
the same target titration using an ARRESTOR oUgonucleotide that extended 4 nt into the 
non-target complementary region of the primary probe. The product of the secondary 
cleavage reaction is seen as a band near the bottom of each panel. The first two lanes of 

25 each panel (/.e„ 1 and 2, 1 1 and 12, 21 and 22) lacked target DNA, and the signal that co- 
migrates with the product band represents the nonspecific background under each set of 
conditions. 

It can be seen by visual inspection of these panels that the background signal is 
both reduced, and made more predictable, by the inclusion of either species of 
30 ARRESTOR oUgonucleotide. In addition to reducing the background in the no-target 
control lanes, the background reduction in the reactions that had the more dilute amounts 
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of target included is reduced, leading to a signal that is a more accurate reflection of the 
target contained within the reaction, thus improving the quantitative range of the 
multiple, sequential invasive cleavage reaction. 

To quantify the impact of including the ARRESTOR oligonucleotide in the 

5 secondary cleavage reaction under these conditions, the average product band signal from 
the reactions having the largest amount of target (i.e., averages of the signals from lanes 9 
and 10, lanes 19 and 20, and lanes 29 and 30), were compared to the averaged signal 
from the no-target contol lanes for each panel, determine the "fold over background," the 
factor of signal amplification over background, under each set of conditions. For the 

10 reactions without the ARRESTOR oligonucleotide, Panel A, the fold over background 
was 5.3; for Panel B, the fold over background was 12.7; and for Panel C, the fold over 
background was 13.4, indicating that in this system inclusion of any ARRESTOR 
oligonucleotide at least doubled the specificity of the signal over the ARRESTOR 
oligonucleotide -less reactions, and that the ARRESTOR oligonucleotide that extended 

1 5 slightly farther into the non-target complementary region may be slightly more effective, 
at least in this embodiment of the system. This clearly shows the benefits of using an 
ARRESTOR oligonucleotide to enhance the specificity of these reactions, an advantage 
that is of particular benefit at low levels of target nucleic acid. 

20 EXAMPLE 15 

"ARRESTOR" Oligonucleotides Allow use of Higher Concentrations of Primary 
Probe Without Increasing Background Signal 

Increasing the concentration of the probe in the invasive cleavage reaction can 
dramatically increase the amount of signal generated for a given amount of target DNA. 

25 While not intending to limit the explanation to any specific mechanism, this is believed to 
be caused by the fact that increased concentration of probe increases the rate at which the 
cleaved probe is supplanted by an uncleaved copy, thereby increasing the apparent 
turnover rate of the cleavage reaction. Unfortunately, this effect could not heretofore be 
applied in the primary cleavage reaction of a multiple sequential INVADER assay 

30 because the residual uncleaved primary probe can hybridize to the secondary target, in 
competition with the cleaved molecules, thereby reducing the efficacy of the secondary 
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reaction. Elevated concentrations of primary probe exacerbate this problem. Further, the 
resulting complexes, as described above, can be cleaved at a low level, contributing to 
background. Therefore, increasing the primary probe can have the double negative effect 
of both slowing the secondary reaction and increasing the level of this form of non target- 

5 specific background. The use of an ARRESTOR oligonucleotide to sequester or 
neutralize the residual primary probe allows this concentration-enhancing effect to be 
applied to these sequential reactions. 

To demonstrate this effect, two sets of reactions were conducted. In the first set 
of reactions, the reactions were conducted using a range of primary probe concentrations, 

10 but no ARRESTOR oligonucleotide was supphed in the secondary reaction. In the 
second set of reactions, the same probe concentrations were used, but an ARRESTOR 
oligonucleotide was added for the secondary reactions. 

All reactions were performed in duplicate. Primary INVADER reactions were 
done in a final volume of 10 [il and contained: 10 mM MOPS, pH 7.5, 7.5 mM MgCb, 

15 500 fin of primary INVADER (218-55-05; SEQ ID NO:596); 30 ng of AfiiFENl enzyme 
and 10 ng of human genomic DNA. 100 zeptomoles of HBV pAM6 #2 amplicon was 
included in all even numbered reactions (by reference to Figs. 35A and B). Reactions 
included 10 pmoles, 20 pmoles, 50 pmoles, 100 pmoles or 150 pmoles of primary probe 
(218-55-02; SEQ ID NO:601). MOPS, target and INVADER oligonucleotides were 

20 combined to a final volume of 7 ^il. Samples were heat denatured at 95°C for 5 minutes, 
then cooled to 67°C. During the 5 minute denaturation, MgCb, probe and enzyme were 
combined. The primary INVADER reactions were initiated by the addition of 3 [il of 
MgCli, probe and enzyme mix, to the final concentrations indicated above. Reactions 
were incubated for 30 minutes at 67°C. The reactions were then cooled to 52°C, and 

25 each primary INVADER reaction received the following secondary reaction components 
in a total volume of 4 2.5 pmoles secondary target (oligo number 218-95-04; SEQ ID 
NO:602); 10 pmoles secondary probe (oligo number 228-48-04; SEQ ID NO:598). The 
reactions that included the ARRESTOR ohgonucleotide had either 40 pmoles, 80 pmoles, 
200 pmoles, 400 pmoles or 600 pmoles of ARRESTOR ohgonucleotide (oligo number 

30 2 1 8-95-0 1 ; SEQ ID NO:600), added at a 4-fold molar excess over the primary probe 
amount for each reaction, with this mix. Reactions were then incubated at 52°C for 30 
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minutes. The reactions were stopped by the addition of 10 jal of a solution of 95% 
formamide, 10 mM EDTA and 0.02% crystal violet. All samples were heated to 95°C 
for 1 minute, and 4 yd of each sample were resolved by electrophoresis through 20% 
denaturing acrylamide gel (19:1 cross-linked) with 7 M urea, in a buffer containing 45 

5 mM Tris-Borate (pH8.3) and 1.4 mM EDTA. The results were imaged using the 
Molecular Dynamics Fluoroimager 595, excitation 488, emission 530. The resulting 
images for the reactions either without or with an ARRESTOR oligonucleotide are shown 
in Figs. 35A and 35B, respectively. The products of cleavage of the secondary probe are 
seen as a band near the bottom of each panel. 

10 In Fig. 35A, lane sets 1 and 2 show results with 10 pmoles of primary probe; 3 

and 4 had 20 pmoles; 5 and 6 had 50 pmoles; 7 and 8 had 100 pmoles; and 9 and 10 had 
150 pmoles. It can be seen by visual examination, that the increases in the amount of 
primary probe have the combined effect of slightly increasing the background in the no- 
target lanes (odd numbers) while reducing the specific signal in the presence of target 

15 (even numbered lanes), and therefore the reducing the specificity of the reaction if 
viewed as the measure of "fold over background," demonstrating that the approach of 
increasing signal by increasing probe cannot be applied in these sequential reactions. 

In Fig. 35B, lane sets 1 and 2 show results with 10 pmoles of primary probe; 
while 3 and 4 had 20 pmoles; 5 and 6 had 50 pmoles; 7 and 8 had 100 pmoles; and 9 and 

20 10 had 150 pmoles. In addition, each reaction included 4- fold molar excess of the 
ARRESTOR ohgonucleotide added before the secondary cleavage reaction. It can be 
seen by visual examination that the background in the no-target lanes (odd numbers) is 
lower in all cases, while the specific signal in the presence of target (even numbered 
lanes) increases with increased amounts of primary probe, leading to a greater "fold over 

25 background" sensitivity at this target level. 

To quantitatively compare these effects, the fluorescence signal from the products 
of both non-specific and specific cleavage were measured. The results are depicted 
graphically in Fig. 35C, graphed as a measure of the percentage of the secondary probe 
cleaved during the reaction, compared to the amount of primary probe used. 

30 Examination of the plots fi-om the no-target reactions confirms that the background in the 
absence of the ARRESTOR oligonucleotide is, in general, roughly two-fold higher, and 
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that both increase slightly with the increasing probe amounts. The specific signals 
however, diverge between the two sets of reaction more dramatically. While the signal in 
the no-ARRESTOR oligonucleotide reactions decreases steadily as primary probe was 
increased, the signal in the ARRESTOR ohgonucleotide reactions continued to increase. 
5 At the highest primary probe concentrations tested, the no-ARRESTOR oligonucleotide 
reactions had specific signal that was only 1.7 fold over background, while the 
ARRESTOR oligonucleotide reactions detected the 100 zmoles (60,000 copies) of target 
with a signal 6.5 fold over background, thus demonstrating the improvement in the 
sequential invasive cleavage reaction when an ARRESTOR oligonucleotide is included. 

10 

EXAMPLE 16 

Modified Backbones Improve Performance of ARRESTOR Oligonucleotides All 
Natural "ARRESTOR'^ Oligo With No 3'-Amine 

The reactions described in the previous two Examples used ARRESTOR 
15 oUgonucleotides that were constructed using 2' 0-methyl ribose backbone, and that 
included a positively charged amine group on the 3' terminal nucleotide. The 
modifications were made specifically to reduce enzyme interaction with the primary 
probe/ARRESTOR ohgonucleotide complex. During the development of the present 
invention, it was determined that the 2' O-methy modified oligonucleotides are somewhat 
20 resistant to cleavage by the 5' nucleases, just as they are slowly degraded by nucleases 
when used in antisense applications {See e,g,, Kawasaki et al, J. Med. Chem., 36:831 
[1993]). 

The presence of an amino group on the 3' end of an ohgonucleotide reduces its 
abihty to direct invasive cleavage. To reduce the possibility that the ARRESTOR 
25 oligonucleotide would form a cleavage structure in this way, an amino group was 
included in the design of the experiments described in this and other Examples. 

Initial designs of the ARRESTOR oUgonucleotides (sometimes referred to as 
"blockers") did not include these modifications, and these molecules were found to 
provide no benefit in reducing background cleavage in the sequential invasive cleavage 
30 assay and, in fact, sometimes contributed to background by inducing cleavage at an 
unanticipated site, presumably by providing some element to an alternative cleavage 
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structure. The effects of natural and modified ARRESTOR oligonucleotides on the 
background noise in these reactions are examined in this Example, 

The efficacy of an "all-natural ARRESTOR oligonucleotide (ie,, an ARRESTOR 
oligonucleotide that did not contain any base analogs or modifications) was examined by 

5 comparison to an identical reactions that lacked ARRESTOR oligonucleotide. All 
reactions were performed in duplicate, and were conducted as follows. Two master 
mixes were assembled, each containing 12,5 mM MOPS, pH 7.5, 500 finoles primary 
INVADER oligonucleotide #218-55-05 (SEQ ID NO:596), 10 ng human genomic DNA 
(Novagen) and 30 ng AfuFENl enzyme for every 8 \i\ of mix. Mix A contained no 

10 added HB V genomic amplicon DNA, mix B contained 600,000 molecules of HB V 

genomic amplicon DNA, pAM6 #2. The mixes were distributed to the reaction tubes, in 
aliquots of 8 p,l/tube as follows: mix A to tubes 1, 2, 5 and 6; and mix B to tubes 3, 4, 7 
and 8. The samples were incubated at 95°C for 4 minutes to denature the HBV genomic 
ampUcon DNA. The reactions were then cooled to 67''C and 2ul of a mix containing 37.5 

15 mM MgCl2 and 10 pmoles 218-55-02B (SEQ ID NO:603) for every 2 |Ltl, was added to 
each sample. The samples were then incubated at 67''C for 30 minutes. Two secondary 
reaction master mixes were prepared, each containing 10 pmoles of secondary probe 
oligo #228-48-04N (SEQ ID NO:604) and 2.5 pmoles of secondary target oUgonucleotide 
#218-95-04 (SEQ ED NO:602) for every 3 ^1 of mix. Mix 2A contained no additional 

20 oligonucleotide, while mix 2B contained 50 pmoles of the natural "ARRESTOR" 

ohgonucleotide #241-62-02 (SEQ ID NO:605). After the initial 30 minute incubation at 
67°C, the temperature was adjusted to 52^C, and 3 |il of a secondary reaction mix was 
added to each sample, as follows: Mix 2A was added to samples #1-4; and Mix 2B was 
added to samples #5-8. The samples were then incubated for 30 minutes at 52°C. The 

25 reactions were then stopped by the addition of 10 |il of a solution of 95% formamide, 10 
mM EDTA and 0.02% crystal violet. 

All of the samples were heated to 95 for 2 minutes, and 4 jal of each sample 
were resolved by electrophoresis through a 20% denaturing acrylamide gel (19:1 cross- 
linked) with 7 M urea, in a buffer containing 45 mM Tris-Borate (pH8.3) and 1.4 mM 
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EDTA. The results were imaged using the Molecular Dynamics Fluoroimager 595, 
excitation 488, emission 530. The resulting image is shown in Fig. 36A. 

To compare the effects of the various modifications made to the ARRESTOR 
oligonucleotides, reactions were performed using ARRESTOR ohgonucleotides having 

5 all natural bases, but including a 3' terminal amine; ARRESTOR oligonucleotides having 
a 3' portion composed of 2' 0-methyl nucleotides, plus the 3' terminal amine; and 
ARRESTOR ohgonucleotides composed entirely of 2' 0-methyl nucleotides, plus the 3' 
terminal amine. These were compared to reactions performed vdthout an ARRESTOR 
oligonucleotide. The reactions were conducted as follows. Two master mixes were 

10 assembled, all mixes contained 14.3 mM MOPS, pH 7.5, 500 finoles primary INVADER 
oUgo #218-55-05 (SEQ ID NO:596) and 10 ng human genomic DNA (Novagen) for 
every 7 ^il of mix. Mix A contained no added HBV genomic amphcon DNA, mix B 
contained 600,000 molecules of HBV genomic amplicon DNA, pAM6 #2. The mixes 
were distributed to the reaction tubes, at 7 |al/tube: mix A to tubes 1, 2, 5, 6, 9, 10, 13 and 

15 14; and mix B to tubes 3, 4, 7, 8, 11, 12, 15 and 16. The samples were warmed to 95°C 
for 4 minutes to denature the HBV DNA. The reactions were then cooled to 67°C and 3 
^1 of a mix containing 25 mM MgCh, 25 pmoles 218-55-02B (SEQ ID NO:603) and 30 
ng AfuFENl enzyme per 3 |j.1, were added to each sample. The samples were then 
incubated at 67°C for 30 minutes. Four secondary reaction master mixes were prepared; 

20 all mixes contained 10 pmoles of secondary probe oUgonucleotide #228-48-046 (SEQ ID 
NO:606) and 2.5 pmoles of secondary target oUgonucleotide #218-95-04 (SEQ ID 
NO:602) for every 3 |j,l of mix. Mix 2A contained no additional oligonucleotide, while 
mix 2B contained 100 pmoles of the natural+amine ARRESTOR oUgonucleotide # 241- 
62-01 (SEQ ID NO:607), mix 2C contained 100 pmoles of partially 0-methyl+amine 

25 oUgonucleotide # 241-62-03 (SEQ ID NO:608) and mix 2D contained 100 pmoles of aU 
O-methyl+amine oligonucleotide #241-64-01 (SEQ ID NO:609). After the initial 30 
minute incubation at 67°C, the temperature was adjusted to 52°C and 3 |xl of a secondary 
reaction mix was added to each sample, as follows: mix 2A was added to samples #1-4; 
mix 2B was added to samples #5-8; mix 2C was added to samples #9-12; and mix 2D 

30 was added to samples #1 3-16. The samples were incubated for 30 minutes at 52°C, then 
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stopped by the addition of 10 of a solution of 95% formamide, 10 mM NaEDTA, and 
0.2% crystal violet. 

All samples were heated to 95 °C for 2 minutes, and 4 \il of each sample were 
resolved by electrophoresis through a 20% denaturing acrylamide gel (19:1 cross-linked) 

5 with 7 M urea, in a buffer containing 45 mM Tris-Borate (pH8.3) and 1 .4 mM EDTA. 
The results were imaged using the Molecular Dynamics Fluoroimager 595, excitation 
488, emission 530. The resulting image is shown in Fig. 36B. 

In Fig. 36A, the left-hand panel shows the reactions that lacked an ARRESTOR 
oligonucleotide, while the right hand panel shows the data from reactions that included 

10 the all natural ARRESTOR oHgonucleotide. The first two lanes of each panel are from 
no-target controls, the second set of lanes contained target. The products of cleavage are 
visible in the bottom one/fourth of each panel. The position at which the specific reaction 
products should run is indicated by arrows on left and right. 

It can be seen by examination of these data, that the reactions run in the absence 

15 of ARRESTOR oligonucleotide show reproducible quaUty between the rephcates, and 
show significant cleavage only when target is present. In contrast, the addition of another 
unmodified oligonucleotide into the reactions causes great variation between the replicate 
lanes (e,g., lanes 5 and 6 were provided with the same reactants, but produced markedly 
different results). The introduction of the all-natural ARRESTOR oHgonucleotide 

20 produced, rather than reduced, background in these no-target lanes, and increased 

cleavage at other sites (Le., the bands other that those indicated by the arrows flanking the 
panels). For these reasons the modifications that are described above, the effects of 
which are shown on Fig. 36B, were incorporated. 

The first 4 lanes of Fig. 36B show the products of duphcate reactions without an 

25 ARRESTOR oligonucleotide, plus or minus the HBV target (lanes 1, 2, and lanes 3, 4, 
respectively); The next 4 lanes, 5, 6 and 7, 8 used a natural ARRESTOR oHgonucleotide 
having a 3' terminal amine; lanes 9, 10 and 11, 12 used the ARRESTOR oligonucleotide 
with a 3' portion composed of 2' 0-methyl nucleotides, and having a 3* terminal amine; 
lanes 13, 14 and 15, 16 used the ARRESTOR oligonucleotide composed entirely of 2* O- 

30 methyl nucleotides and having a 3' terminal amine. The products of cleavage of the 
secondary probe are visible in the lower one third of each panel. 
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Visual inspection of these data shows that the addition of the 3' terminal amine to 
the natural ARRESTOR oligonucleotide suppresses the aberrant cleavage seen in Fig. 
36A, but this ARRESTOR oligonucleotide does not improve the performance of the 
reaction, as compared to the no-ARRESTOR oUgonucleotide controls. In contrast, the 
use of the 2' 0-methyl nucleotides in the body of the ARRESTOR oligonucleotide does 
reduce background, whether partially or completely substituted. To quantify the relative 
effects of these modifications, the fluorescence from each of the co-migrating product 
bands was measured, the signals from the duplicate lanes were averaged and the "fold 
over background" was calculated for each reaction containing target nucleic acid. 

When ARRESTOR oligonucleotide was omitted, the target-specific signal (lanes 
3, 4) was 27-fold over the no target background; the natural ARRESTOR oligonucleotide 
+amine gave a signal of 17-fold over background; the partial 2' 0-methyl+ amine gave a 
signal of 47-fold over background; and the completely T O-methyR amine gave a signal 
of 3 3 fold over background. 

These Figures show that both modifications can have a beneficial effect on the 
specificity of the multiple, sequential invasive cleavage assay. They also show that the 
use of the 2' O-methyl substituted backbone, either partial or entire, markedly improves 
the specificity of these reactions. It is intended that, in various embodiments of the 
present inventon, any number of modifications that make either the ARRESTOR 
oligonucleotide or the complex it forms with the primary target resistant to nucleases will 
provide similar enhancement. 

EXAMPLE 17 

Effect of ARRESTOR Oligonucleotide Length on Signal Enhancement in Multiple 

Sequential Invasive Cleavage Assays 
As noted in the Description of the Invention, the optimal length for an 
ARRESTOR oligonucleotide depends upon the design of the other nucleic acid elements 
of the INVADER reaction, particularly on the design of the primary probe, hi this 
Example, the effects of varying the length of the ARRESTOR oligonucleotide were 
explored in systems using two different secondary probes. A schematic diagram showing 
these ARRESTOR oligonucleotides aligned as they would hybridize to the primary probe 
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oligonucleotide is provided in Fig. 37C. In this Figure, the region of the primary probe 
that recognizes the target nucleic acid is shown underlined; the non-underlined portion, 
plus the first underlined base is the portion that is released by the first cleavage, and goes 
on to participate in the second or subsequent cleavage structure. 

5 All reactions were performed in duplicate. The INVADER reactions were done 

in a final volume of 10 jal final volume containing 10 mM MOPS, pH 7.5, mM MgCli, 
500 finoles of primary INVADER 241-95-01, (SEQ ID NO:610), 25 pmoles of primary 
probe 241-95-02 (SEQ ID N0:61 1), 30 ng of AfuFENl enzyme, and 10 ng of human 
genomic DNA, and if included, 1 amoles of HBV amplicon pAM 6 #2. MOPS, target 

10 DNA, and INVADER oligonucleotides were combined to a final volume of 7 |iL 

Samples were heat denatured at 95*^0 for 5 minutes, then cooled to 67°C. During the 5 
minute denaturation, MgCb, probe and enzyme were combined. The primary INVADER 
reactions were initiated by the addition of 3 )il of MgCb, probe and enzyme mix, to the 
final concentrations indicated above. Reactions were incubated for 30 minutes at 67''C. 

15 The reaction were then cooled to 52°C, and each primary INVADER reaction received 
the following secondary reaction components in a total volume of 3 |li1: 2.5 pmoles 
secondary target 241-95-07 (SEQ ID NO:612), 10 pmoles of either secondary probe 228- 
48-04 (SEQ ID NO:598), or 228-48-04N (SEQ ID NO:604) and 100 pmoles of an 
ARRESTOR oUgonucleotide, either 241-95-03 (SEQ ID NO:613), 241-95-04 (SEQ ID 

20 NO:614), 241-95-05 (SEQ ED NO:615) or 241-95-06 (SEQ ID NO:616). The 
ARRESTOR oligonucleotides were omitted fi"om some reactions as controls for 
ARRESTOR oligonucleotide effects. 

The reactions were incubated at 52''C for 34 minutes, and were then stopped by 
the addition of 10 ^il of 95% formamide, 10 mM EDTA, and 0.02% crystal violet. All 

25 samples were heated to 95''C for 1 minute, and 4 jxl of each sample were resolved by 
electrophoresis through 20% denaturing acrylamide gel (19:1 cross-linked) with 7 M 
urea, in a buffer containing 45 mM Tris-Borate (pH83) and 1.4 mM EDTA. The resuUs 
were imaged using the Molecular Dynamics Fluoroimager 595, excitation 488, emission 
530. The resulting images for the reactions with the shorter and longer secondary probes 

30 are shown in Figs. 37A and 37B, respectively. 
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In each Figure, the products of cleavage are visible as bands in the bottom half of 
each lane. The first 4 lanes of each Figure show the products of duplicate reactions 
without an ARRESTOR oligonucleotide, plus or minus the HBV target (lanes sets 1 and 
2 respectively); in the next 4 lanes, sets 3 and 4 used the shortest ARRESTOR 241-95-03 

5 (SEQ ID NO:613); lanes 5 and 6 used 241-95-04 (SEQ ID NO:614); lanes 7 and 8 used 
241-95-05 (SEQ ID NO:615); and lanes 9 and 10 used 241-95-06 (SEQ ID NO:616). 

The principal background of concern is the band that appears in the "no target" 
control lanes (odd numbers; this band co-migrates with the target-specific signal near the 
bottom of each gel panel). Visual inspection shows that the shortest ARRESTOR 

10 oligonucleotide was the least effective at suppressing this background, and that the 

efficacy was increased when the ARRESTOR oligonucleotide extended further into the 
portion that participates in the subsequent cleavage reaction. Even with this difference in 
effect, it can be seen firom these data that there is much latitude in the design of the 
ARRESTOR oligonucleotide. The choice of lengths will be influenced by the 

15 temperature at which the reaction making use of the ARRESTOR oligonucleotide is 

performed, the lengths of the duplexes formed between the primary probe and the target, 
the primary probe and the secondary target, and the relative concentrations of the 
different nucleic acid species in the reactions. 

20 EXAMPLE 18 

Effect of ARRESTOR Oligonucleotide Concentration on Signal Enhancement in 
Multiple Sequential Invasive Cleavage Assays 
In examining the effects of including ARRESTOR oligonucleotides in these 
cleavage reactions, it was of interest to determine if the concentration of the ARRESTOR 
25 oligonucleotide in excess of the primary probe concentration would have an effect on 
yields of either non-specific or specific signal, and if the length of the ARRESTOR 
oHgonucleotide would be a factor. These two variables were investigated in the 
following Example. 

All reactions were performed in dupUcate. The primary INVADER reactions 
30 were done in a final volume of 10 \i\ and contained 10 mM MOPS, pH 7.5; 7.5 mM 
MgCl2, 500 fmoles of primary INVADER 241-95-01 (SEQ ID NO:610), 25 pmoles of 
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primary probe 241-95-02 (SEQ ID N0:611), 30 ng of AfuFENl enzyme, and 10 ng of 
human genomic DNA. Where included, the target DNA was 1 amole of HBV amplicon 
pAM 6 #2, as described above. MOPS, target and INVADER were combined to a final 
volume of 7 p,!. The samples were heat denatured at 95°C for 5 minutes, then cooled to 
5 67°C. During the 5 minute denaturation, MgCla, probe and enzyme were combined. The 
primary INVADER reactions were initiated by the addition of 3 \il of MgCl2, probe and 
enzyme mix. The reactions were incubated for 30 minutes at 67°C. The reactions were 
then cooled to Sl'^C and each primary INVADER reaction received the following 
secondary reaction components: 2.5 pmoles secondary target 241-95-07 (SEQ ID 
10 NO:612), 10 pmoles secondary probe 228-48-04 (SEQ ID NO:598); and, if included, 50, 
5 100 or 200 pmoles of either ARRESTOR oligonucleotide 241-95-03 (SEQ ID NO:613) 

ffl or 241-95-05 (SEQ ID NO:615), in a total volume of 3 \xl Reactions were then 

I" J n 

i incubated at 52°C for 35 minutes. Reactions were stopped by the addition of 10 of 

fl 95% formamide, 1 0 mM EDTA, and 0.02% crystal violet. All of the samples were 

15 heated to 95''C for 1 minute, and 4 |^1 of each sample were resolved by electrophoresis 
p through 20% denaturing acrylamide gel (19:1 cross-linked) with 7 M urea, in a buffer 

% ; containing 45 mM Tris-Borate (pH8.3), and 1.4 mM EDTA. The results were imaged 

^ using the Molecular Dynamics Fluoroimager 595, excitation 488, emission 530. The 

|i resulting images are shown as a composite image in Fig. 38. 

20 Each of the duplicate reactions were loaded on the gel in adjacent lanes and are 

labeled with a single lane number. All odd numbered lanes were no-target controls. 
Lanes 1 and 2 had no ARRESTOR oUgonucleotide added; lanes 3-8 show results from 
reactions containing the shorter ARRESTOR oUgonucleotide, 241-95-03 (SEQ ID 
NO:613); lanes 9-14 show results from reactions containing the longer ARRESTOR 
25 oligonucleotide, 241-95-05 (SEQ ID NO:615). The products of cleavage from the 

secondary reaction are visible in the bottom one third of each panel. Visual inspection of 
these data (Le,, comparison of the specific products to the background bands) shows that 
both ARRESTOR oligonucleotides have some beneficial effect at all concentration. 
To quantify the relative effects of ARRESTOR oUgonucleotide length and 
30 concentration, the fluorescence from each of the co-migrating product bands was 
measured, the signals from the duplicate lanes were averaged and the "fold over 
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background" (signal+target/signal-target) was calculated for each reaction containing 
target nucleic acid. The reaction lacking an ARRESTOR oligonucleotide yielded a signal 
approximately 27-fold over background. Inclusion of the shorter ARRESTOR 
oligonucleotide at 50, 100 or 200 pmoles produced products at 42, 51 and 60-fold over 
5 background, respectively. This shows that while the short ARRESTOR at the lowest 
concentration seems to be less effective than the longer ARRESTOR oligonucleotides 
(See, e.g., previous Example) this can be compensated for by increasing the concentration 
of ARRESTOR ohgonucleotide, and thereby the ARRESTOR oligonucleotide :primary 
probe ratio. 

10 In contrast, inclusion of the longer ARRESTOR oligonucleotide at 50, 1 00 or 200 

pmoles produced products at 60, 32 and 24 fold over background, respectively. At the 
lowest concentration, the efficacy of this longer ARRESTOR oligonucleotide relative to 
the shorter ARRESTOR oligonucleotide is consistent with the previous Example. 
Increasing the concentration, however, decreased the yield of specific product, suggesting 

15 a competition effect with some element of the secondary cleavage reaction. 

These data show that the ARRESTOR oligonucleotides can be used to advantage 
in a number of specific reaction designs. The choice of concentration will be influenced 
by the temperature at which the reaction making use of the ARRESTOR oligonucleotide 
is performed, the lengths of the duplexes formed between the primary probe and the 

20 target, the primary probe and the secondary target, and between the primary probe and 
the ARRESTOR ohgonucleotide. 

Selection of ohgonucleotides for target nucleic acids other than the HBV shown 
here, (e.g,, ohgonucleotide composition and length), and the optimization of cleavage 
reaction conditions in accord with the models provided here follow routine methods and 

25 common practice well known to those skilled in the methods of molecular biology. 

Example 10 demonstrated that some enzymes require an overlap between an 
upstream INVADER oligonucleotide and a downstream probe oligonucleotide to create a 
cleavage structure (Figure 27). It has also been determined that the 3' terminal nucleotide 
of the INVADER ohgonucleotide need not be complementary to the target strand, even if 

30 it is the only overlapping base in the INVADER ohgonucleotide (e.g., as with the HCMV 
probes shown in Figure 30). The requirement for an overlap can serve as a convenient 
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basis for detecting single base polymorphisms (SNPs) or mutations in a nucleic acid 
sample. 



a probe and an INVADER oligonucleotide) hybridize in tandem to the target nucleic acid 
5 to form the overlapping structure recognized by the CLEAVASE enzyme to be used in 
the reaction. An unpaired "flap" is included on the 5' end of the probe. The enzyme 
recognizes the overlap and cleaves off the unpaired flap, releasing it as a target-specific 
product. Enzymes that have a strong preference for an overlapping structure, i.e., that 
cleave the overlapping structure at a much greater rate than they cleave a non- 
10 overlapping structure include the FEN-1 enzymes from Archaeoglobus fulgidus and 
Pyrococcus furiosus and such enzymes are particularly preferred in the detection of 
mutations and SNPs. . 



In some embodiments, the present invention provides kits comprising one or more 
of the components necessary for practicing the present invention. For example, the 
present invention provides kits for storing or delivering the enzymes of the present 
invention and/or the reaction components necessary to practice a cleavage assay (e.g., the 
20 INVADER assay). By way of example, and not intending to Umit the kits of the present 
invention to any particular configuration or combination of components, the following 
section describes one embodiment of a kit for practicing the present invention: 

In some embodiments, the kits of the present invention provide the following reagents: 



For detection of single base variations, at least two oligonucleotides (e.g., 



EXAMPLE 19 



15 



Kits for performing the mRNA INVADER Assay 



25 



CLEAVASE enzyme (e.g., 
TthAKK) 



Primary OUgos 



RNA Primary Buffer 1 
RNA Secondary Buffer 1 
tRNA Carrier [20 ng/|il] 



Secondary Oligos 

RNA Standard [lOOamoyjal] 

lOX Cell Lysis Buffer 1 



Tioeo.i Buffer [lOmM Tris*HCl, 



pH 8, O.lmMEDTA] 
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Examples of Primary Oligonucleotides and Secondary Oligonucleotides suitable 
for use with the methods of the present invention are provided in Figure 4L While the 
oligonucleotides shown therein may find use in a number of the methods, and variations 
of the methods, of the present invention, these INVADER assay oligonucleotide sets find 

5 particular use with kits of the present invention. The oHgonucleotide sets shown in 
Figure 41 may be used as individual sets to detect individual target RNAs, or may be 
comined in biplex or multiplex reactions for the detection of two or more analytes or 
controls in a single reaction. It is contemplated that the designs of these probes sets (e.g., 
the oligonucleotides and/or their sequences) may be adapted for use in DNA detection 

10 assays, using the guideUnes for reaction design and optimization provided herein. 
Additional oligonucleotides that find use in detection assays and kits of the present 
invention are provided in Figure 47. 

In some embodiments, a kit of the present invention provides a list of additional 
components (e.g., reagents, supplies, and/or equipment) to be supplied by a user in order 

15 to perform the methods of the invention. For example, and without intending to limit 

such additional components Usts to any particular components, one embodiment of such a 
list comprises the following: 

■ RNase-free (e.g., DEPC-treated) H2O 

■ Clear CHILLOUT-14 Uquid wax (MJ Research) or RNase-firee, optical grade mineral 
20 oil (Sigma, Cat. No. M-5904) 

■ Phosphate-buffered saline (no MgCb, no CaCb) 

■ 96-well polypropylene microplate (MJ Research, Cat, No. MSP-9601) 

■ 0.2-ml thin-wall tubes 

■ Thermaseal well tape (e.g., GeneMate, Cat. No. T-2417-5) 
25 ■ Multichannel pipets (0.5-10^1, 2.5-20^1, 20-200)111) 

■ Thermal cycler or other heat source (e.g., lab oven or heating block). 

■ Fluorescence microplate reader (a preferred plate reader is top-reading, equipped with 
Ught filters have the following characteristics: 
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Excitation 



Emission 



(Wavelength/Bandwidth) 



(Wavelength/Bandwidth) 



485 nm/20 nm 



530 nmy25 nm 



560 nna/20 nm 



620 nm/40 nm 



In some embodiments, a kit of the present invention provides a list of optional 
components (e.g., reagents, supplies, and/or equipment) to be supplied by a user to 
facilitate performance of the methods of the invention. For example, and v^ithout 
5 intending to limit such optional components lists to any particular components, one 
embodiment of such a hst comprises the following: 

■ tRNA Solution, 20ng/|Lil (Sigma, R-5636) 

" IX Stop Solution (lOmM Tris*HCl, pH 8, lOmM EDTA) 

■ Black opaque, 96-well microplate (e.g., COSTAR, Cat. No. 3915) 
10 " Electronic repeat pipet (250ial) 

In some embodiments of a kit, detailed protocols are provided. In preferred 
embodiments, protocols for the assembly of INVADER assay reactions (e.g., 
formulations and preferred procedures for making reaction mixtures) are provided. In 

15 particularly preferred embodiments, protocols for assembly of reaction mixtures include 
computational or graphical aids to reduce risk of error in the performance of the methods 
of the present invention (e.g., tables to facilitate calculation of volumes of reagents 
needed for multiple reactions, and plate-layout guides to assist in configuring multi-well 
assay plates to contain numerous assay reactions). By way of example, and without 

20 intending to limit such protocols to any particular content or format, kits of the present 
invention may comprise the following protocol: 

1 . DETAILED mRNA INVADER ASSAY PROTOCOL 

I. Plan the microplate layout for each experimental run. An example microplate layout 
25 for 40 samples, 6 standards, and a No Target Control is shown in Fig. 40. Inclusion of 
a No Target Control (tRNA Carrier or IX Cell Lysis Buffer 1) and quantitation 
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standards are required for absolute quantitation. 

Prepare the Primary Reaction Mix for either the single or biplex assay format. To 
calculate the volumes of reaction components needed for the assay (X Volume), 
multiply the number of reactions (for both samples and controls) by 1.25 [X Volume 
()j.l) = # reactions x 1.25] . Vortex the Primary Reaction Mix briefly after the last 
reagent addition to mix thoroughly. Aliquot 5^1 of the Primary Reaction Mix per 
microplate well (an electronic repeat pipet is recommended for this step). 

Primary Reaction Mix 



Single Assay Format 



Reaction Components 


IX Volume 


X Volume 


RNA Primary Buffer 1 


4.0ial 




Primary Oligos 


0.25)al 




Tioeo.i Buffer 


0.25|il 




CLEAVASE enz. enzyme 


0.5^1 




Total Mix Volume (IX) 


S.Ofxl 




Biplex Assay Format 


Reaction Components 


IX Volume 


X Volume 


RNA Primary Buffer 1 


4.0^il 




Primary Oligos 


0.25 nl 




Housekeeping Primary Oligos 


0.25nl 




CLEAVASE enzyme 


0.5ixl 




Total Mix Volume (IX) 


5.0^1 





Add 5^x1 of each No Target Control, standard, or sample (total RNA or cell lysate) to 
the appropriate well and mix by pipetting up and down 1-2 times. Overlay each 
reaction with I0\i\ of clear CHILLOUT or mineral oil. Seal microplate with 
Thermaseal well tape. 

Incubate reactions for 90 minutes at 60°C in a thermal cycler or oven. 
While the primary reaction is incubating, prepare the Secondary FRET Reaction Mix 
for the single or biplex format. Calculate the component volumes required (X 
Volume) by multiplying the number of reactions (for both samples and controls) by 
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1.25 [X Volume (\xl) == # reactions x L25 Aliquot the Secondary FRET 
Reaction Mix into multiple 0.2-ml thin-wall tubes or an 8-well strip (70jxl/tube is 
sufficient for a row of 12 reactions). 



Secondary FRET Reaction Mix 
Single Assay Format 



Reaction Components 


IX Volume 


X Volume 


RNA Secondary Buffer 1 


2.0^1 




Secondary Oligos 


1.5^1 




TioCoi Buffer 






Total Mix Volume (IX) 


5.0^1 




Biplex Assay Format 


Reaction Components 


IX Volume 


X Volume 


RNA Secondary Buffer 1 


2.0ial 




Secondary Oligos 


1.5^1 




Housekeeping Secondary 


1.5^1 




Total Mix Volume (IX) 


S.O^il 





After the primary reaction incubation is completed, remove the microplate seal, and 
add 5)al Secondary FRET Reaction Mix per well using a multichannel pipet. Mix by 
pipetting up and down 1-2 times. Reseal the microplate with the well tape and 
incubate the microplate at 60*^0 for 60 or 90 minutes, as indicated in each Product 
Information Sheet. The secondary reaction incubation time can be varied. See 
sections 2 of the PROCEDURAL NOTES FOR OPERATION OF THE mRNA 
INVADER ASSAY for details. 

Reactions can be read using one of two procedures: Direct Read or Stop and 
Transfer . 

NOTE: Remove the microplate seal before reading the microplate. 
Direct Read Procedure 
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■ This procedure enables collection of multiple data sets to extend the assay's 
dynamic range. During the secondary INVADER reaction, read the microplate 
directly in a top-reading fluorescence microplate reader. 
■ Recommended settings for a PerSeptive Biosystem Cytofluor 4000 instrument are as 
5 follows: 



Specific Gene Signal: 


Housekeeping Gene Signal: 


Excitation: 


485/20mn 


Excitation: 


560/20mn 


Emission: 


530/25mn 


Emission: 


620/40nm 


ReadsAVell: 


10 


ReadsAVell: 


10 


Gain: 


40 


Gain: 


45 


Temperature: 


25°C 


Temperature: 


25°C 



NOTE: Because the optimal gain setting can vary between instruments, adjust 
the gain as needed to give the best signal/background ratio (sample raw signal 
divided by the No Target Control signal) or No Target Control sample readings of 
15 ---lOO RFUs. Fluorescence microplate readers that use a xenon lamp source 

generally produce higher RFUs. For directly reading the microplates, the probe 
height of, and how the plate is positioned in, the fluorescence microplate reader 
may need to be adjusted according to the manufacturer's recommendations. 

Stop and Transfer Procedure 
20 L Prepare IX Stop Solution (lOmM Tris^HCl, pH 8, lOmM EDTA) with 

RNase-free H2O. Add 1 OOjliI per well with a multichannel pipet. 

2. Transfer lOOjil of the diluted reactions to a black microplate (e.g., 
COSTAR (Coming), Cat. No. 3915), 

3. Read the microplate using the same parameters as the Direct Read 

25 Procedure, but adjust the gain to give No Target Control sample readings of -100 

RFUs (see NOTE above). 



In some embodiments, supplemenatary documentation, such as protocols for 
ancillary procedures, e.g., for the preparation of additional reagents, or for preparation of 
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samples for use in the methods of the present invention, are provided. In preferred 
embodiments, supplementary documentation includes guidelines and hsts of precautions 
provided to facilitate successful use of the methods and kits by unskilled or inexperienced 
users. In particularly preferred embodiments, supplementary documentation includes a 

5 troubleshooting guide, e,g. , a guide describing possible problems that may be 

encountered by users, and providing suggested solutions or corrections to intended to aid 
the user in resolving or avoiding such problems. 

For example, and without intending to limit such supplementary documentation to 
any particular content, kits of the present invention may comprise any of the following 

10 procedures and guidelines: 

II. AVOIDANCE OF RNase CONTAMINATION 

To avoid RNase contamination during sample preparation and testing, in one 
embodiment, the user is cautioned to observe the following precautions: 
15 • Wear disposable gloves at all times to avoid contact with samples and reagents. 

• Use certified RNase-free disposables, including thin-wall polypropylene tubes and 
aerosol-barrier pipet tips, for preparing samples and assay reagents, to avoid cross- 
contamination. 

• Use RNase-free (DEPC-treated) H2O for diluting samples and/or reagents. 
20 • Keep RNA samples and controls on ice during assay setup. 

III. SAMPLE AND CONTROL PEUEPARATION 

NOTE: Dilute both standards and samples to concentrations that correspond to a 
5-|al addition per reaction. 
25 Example 1: The concentration of a 5-attomole standard is lamol/j^l. lamol = 10' 

mole - 602,000 molecules. 
Example 2: The concentration of a 100-ng sample should be 20ng/^l. 
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A. Control Preparation 

No Target Control : 

Total RNA Format: tRNA Carrier (20ng/^l) 

Cell Lysate Format: IX Cell Lysis Buffer 1 (dilute lOX Cell Lysis Buffer 1 to IX 
5 with RNase-free H2O) 

Positive Control : RNA Standard (Std) (lOOamol/^il in vitro transcript) 

1 . Prepare RNA standards by diluting the positive controls with tRNA Carrier (when 
running total RNA samples) or with IX Cell Lysis Buffer 1 [lOX Cell Lysis Buffer 

10 1 diluted with RNase-free H2O] (when running cell lysate samples). The Product 

Information Sheet included in each kit indicates the recommended standard test 
levels and preparation methods. 

2. Using a fresh set of standards for each run is recommended. Store the standards on 
ice during reaction setup, 

15 

B. Total RNA Sample Preparation 

1 . Prepare total RNA from cells or tissue according to manufacturer's instructions for 
the selected preparation method. Recommended methods include TRIZOL (Life 
Technologies, RockviUe, MD), RNEASY (Qiagen, Valencia, CA), and RNA WIZ 

20 (Ambion, Austin, TX). 

2. Dilute total RNA samples with RNase-free H2O to the appropriate concentration. 

C. Cell Lysate Sample Preparation - 96-well microplate format 
NOTE: This cell lysate detection format is used for adherent cells cultured in 96-well 
25 tissue culture microplates. Cells are typically seeded at 10,000-40,000 cells per well. 
Different seeding densities may be required depending on cell type and/or mRNA 
expression levels. See Procedural Notes for more details. For cells exhibiting high 
expression, the following methods can be used to attenuate the signal from the cell 
ly sates: 
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■ plate fewer cells per well; 

■ dilute the cell lysates with IX Cell Lysis Buffer 1 before addition to the 
reaction (e.g., LS^d lysate + 2.5 jil IX Cell Lysis Buffer 1); 

■ read the reaction microplate 1 5-30 minutes after addition of the 

5 Secondary FRET Reaction Mix instead of the recommended 60-90 

minutes; 

L Dilute lOX Cell Lysis Buffer 1 to a IX concentration with RNase-free H2O. 

2. Using a multichannel pipet, carefully remove the culture medium from the wells of 
1 0 adherent cells without disturbing the cell monolayer. 

3. Wash the cells once with 200ili1 PBS (no MgCli, no CaCb) and carefully remove the 
residual PBS with the multichannel pipet. 

4. Add 40|Lil IX Cell Lysis Buffer 1 per well Lyse cells at room temperature for 3-5 
minutes. 

15 5. Using a multichannel pipet, carefully transfer 25|iil of each lysate sample into a 96- 
well microplate. Avoid transferring cellular material from the bottom of the well. 

6. Overlay each lysate sample with lOjil clear CHILLOUT or mineral oil (overlaying is 
not necessary if using a heated-Ud thermal cycler). 

7. Seal microplate with Thermaseal well tape. Immediately heat lysates at 75-80°C for 
20 15 minutes in a thermal cycler or oven to inactivate cellular nucleases. 

8. During the heating step, proceed with the reaction setup. See DETAILED mRNA 
INVADER ASSAY PROTOCOL (above) for instructions. 

9. After the heat inactivation step, add the lysate samples immediately to the reaction 
microplate. Alternatively, the lysate samples can be quickly transferred to a -TC'C 

25 freezer for later testing (long-term stability has not been estabhshed and may differ 

for each cell type). 
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IV. PROCEDURAL NOTES FOR OPERATION OF THE mRNA INVADER 
ASSAY 

1 . RNA sample types and optimization of RNA sample amount. 

The assay is optimized for performance with total RNA samples prepared 
from either tissue or cells. Several total RNA preparation methods/kits have been 
vaUdated for performance in the mRNA INVADER assay: 

• TRIZOL (Life Technologies, Rockville, MD) 

• RNeasy (Qiagen, Valencia, CA) 

• RNA WIZ (Ambion, Austin, TX) 

It is important to use a method or kit that minimizes the level of genomic 
DNA, which can inhibit signal generation. Performance of a preliminary 
experiment is recommended to determine the amount of total RNA sample 
(typically l-200ng, depending on the gene's expression level) that provides the 
best Hmit of detection and dynamic range. 

The assay has also been validated with lysate samples from a number of 
cell types. Recommended cell densities in a 96-well tissue culture microplate are 
10,000-40,000 cells per well depending on cell type and expression level of the 
gene of interest. Performance of a preliminary experiment is recommended for 
any given cell line and/or gene being monitored. Such an experiment should 
include different cell density levels and/or dilution of the lysate samples with IX 
Cell Lysis Buffer 1 (e.g. a 1)^1 test level is prepared by mixing Ijil lysate sample + 
4)Lil IX Cell Lysis Buffer 1 for a 5|il sample addition). 

2. Dynamic range modulation: variable secondary reaction incubation times. 

The length of the secondary reaction incubation time Usted in the protocol 
is sufficient for most analytes. However, the linear detection range 
(Signal/Background < 15-25) can be adjusted by reading the reaction microplate 
at variable times after addition of the secondary FRET reagents. For example. 
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high expression samples can often be detected in 15-30 minutes. The Direct 
Read method (DETAILED mRNA INVADER ASSAY PROTOCOL, step 7) 
enables simple optimization of the secondary reaction time as the reaction 
microplate can be incubated further if an early time read does not provide enough 
signal from the samples being tested. 

Monitoring the secondary reaction fluorescence signal with time can also extend 
the dynamic range of the assay. The Direct Read method at multiple time points 
can be apphed using low-cost instrumentation. Altematively, real-time 
fluorescence instrumentation can be used to achieve comparable dynamic ranges 
exhibited by other mRNA quantitation methods. 

Dynamic range modulation: variable sample levels. 

While the FRET detection method greatly simplifies the assay, the 
dynamic range is typically limited to 2-3 logs when using an endpoint read 
method. However, since mRNA INVADER assay signal is generated linearly 
with both target level and time, the easiest method for extending the dynamic 
range beyond 3 logs (as may be required, e.g., for highly induced genes) is to 
adjust total RNA sample levels. Fold changes in gene expression (treated sample 
signal divided by untreated sample signal) can be reliably calculated using 
normalized sample signals. This is accomplished by testing sample levels that 
give signal within the linear detection range defined by the standard curve. For 
example, the fold induction for a highly induced sample can be calculated as 
follows: 

Fold induction (Net Signal for 1 ng treated sample X 100) / Net Signal 
for 100 ng untreated sample 
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V. TROUBLESHOOTING GUIDE 



Problem 


Possible Solution 


No signal 


• Check that the fluorescence microplate reader has been set up 
correctly and that the appropriate excitation and emission filters are 
in place. 

• Perform mRNA INVADER assay with the provided standard as a 
positive control 

• Potential RNase contamination of the samples and reagents. 
Discard suspect reagents. 

• Use only reagents and oligonucleotides supplied in the kit. Do not 
mix reagents or oligonucleotides between kits. 


High variation 

between 

replicates 


• Always work with master primary and secondary reaction mixes. 

m T'Tior/Mi frill \/ mi Y stll milctpr TTiiYPQ JlTirl Ql^TtrnlP*? 

• i ilUiUU^Iliy XlilA all iliaotCi ililACo ollU. Da-liipito. 

• Pipet in a similar manner across all the controls and samples. 

• Calibrate pipets frequently. 


Lack of low 
target level 
detection 


• Calibrate thermal cycler or heat block. 

• Minimize assay variability (see above), i.e. CVs are less than 5% 
for the sample replicates. This is particularly important for 
detecting low target levels. 


Lack of 
discrimination 
between high 
signal samples 


• Decrease secondary reaction incubation time to achieve detection 
within the linear range of the assay. 

• Use less total RNA per reaction. 

• Attenuate cell lysate sample signal (see NOTE, Sample and Control 
Preparation, Part C). 


Signal 
inhibition 


• Run samples on an agarose gel to check for presence of genomic 
DNA. Alter the RNA sample isolation method to minimize 
genomic DNA or presence of other inhibitors. The same isolation 
procedure should be used throughout an experiment. 

• If using the cell lysate format, residual PBS can be inhibitory. Be 
sure to remove residual PBS from the tissue culture microplate. Do 
not use PBS that contains MgCh or CaCb , which inhibits the 
assay. 
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APPENDIX A: 
mRNA INVADER SINGLE ASSAY WORKSHEET 

mRNA INVADER ASSAY PROCEDURE 

■ Prepare samples and controls. 

5 ■ Prepare Primary Reaction Mix. Vortexbriefly and aliquot Sniper well 

■ Add 5|Lil sample or control per well and pipet up and down 1-2 times. 
« Add 1 0|il CHILLOUT or mineral oil per well. 

■ Incubate primary reaction at 60''C for 90 minutes. 

■ Prepare Secondary FRET Reaction Mix, vortex briefly. 

10 ■ Using a multichannel pipet, aliquot 5|Ltl per well and pipet up and down 1-2 times. 

■ Incubate secondary reaction at 60''C for 60 or 90 minutes. 

« Read microplate in fluorescence microplate reader (FAM Dye: Ex. 485 nm/Em. 
530 nm). 

15 PRIMARY REACTION MIX 



Reaction Components 


IX Volume 


X Volume (No. of 
reactions x 1 .25^ 


RNA Primary Buffer 1 


4.0ial 




Primary Oligos 


0.25 |xl 




Tioeo.i Buffer 


0.25^1 




CLEAVASEenzyme 


0.5ia1 




Total Mix Volume (IX) 


5.o^l 




SECONDARY FRET REACTION MIX 


Reaction Components 


IX Volume 


X Volume 
CNo. of reactions x 1 .25) 


RNA Secondary Buffer 1 


l.Oyd 




Secondary Oligos 


1.5|al 




Tioeo.i Buffer 


1.5|al 




Total Mix Volume (IX) 


5.0^1 
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APPENDIX B: 
mRNA INVADER BIPLEX ASSAY WORKSHEET 

thRNA invader ASSAY PROCEDURE 

■ Prepare samples and controls. 

■ Prepare Primary Reaction Mix. Vortex briefly and aliquot 5[il per well 

■ Add 5p,l sample or control per well and pipet up and down 1-2 times, 

■ Add lOiiil CHILLOUT or mineral oil per well. 

■ Incubate primary reaction at 60°C for 90 minutes, 

■ Prepare Secondary FRET Reaction Mix, vortex briefly. 

■ Using a multichannel pipet, aliquot 5|al per well and pipet up and down 1-2 times. 

■ Incubate secondary reaction at 60°C for 60 or 90 minutes. 

■ Read microplate in fluorescence microplate reader (FAM Dye: Ex. 485 nm/Em. 
530 nm and red dye: Ex. 560nm/Em. 620nm). 



PRIMARY REACTION MIX 



Reaction Components 


IX Volume 


X Volume 
(# reactions x 1 .25) 


RNA Primary Buffer 1 


4.0^il 




Primary Oligos 


0.25nl 




Housekeeping Primary Oligos 


0.25^1 




CLEAVASE IX enzyme 


0.5^1 




Total Mix Volume (IX) 


S.Ojxl 




SECONDARY FRET REACTION MIX 


Reaction Components 


IX Volume 


X Volume 
(# reactions x ^.25^ 


RNA Secondary Buffer 1 


2.0jxl 




Secondary Oligos 


1.5|xl 




Housekeeping Secondary Oligos 


1.5nl 




Total Mix Volume (IX) 


S.O^iI 





21% 



TABLE 2 



5 



Klenow 


Kcatfs' 


KmfdNTP) 


Kd(nM) 


Relative DiN A 


Reference 


i aq r 01 




a 






affinity 






Wild-Type 


2.4 


2.8 


8 


1 


2 


WiiQ-iype 


S610A 


n. d. 


~ 


n. d. 




c 

J 


oD 1 J 


R668A 


0.006 


6.5 


140, 150 


A r\£i A AC 


1 O 




N678A 


n. d. 


- 


n. d. 




c 

D 




E710A 


0.1 


15 


250 


0.03 


Z 




E710D 


1.7 


7.7 


110 


0.07 


2 


boi 


K75oA 


A 1 '2 1 
U.l J 1 


ID.O 






A 


K663 


K758R 


2.0 


2.1 




1.125 


4 


K663 


Y766S 


0.8 


6.4 


13 


0.4, 0.6 


1,2 


Y671 


R841A 


0.3 


9.8 


40,53 


0.2 


1,2 


R746* 


N845A 


1.0 


23 


8,5 


1.0, 1.7 


1,2 


N750* 


N845Q 


0.03 


1.7 


80, 55 


0.1,0.2 


1,2 


N750* 


Q849A 


0.02 


3.8 


100, 160 


0.08, 0.05 


1,2 


Q754 


Q849E 


0.001 


n. d. 


90, 91 


0.09 


1,2 


Q754 


H881A 


0.3 


3.3 


20, 28 


0.4, 0.3 


1,2 


H784* 


D882N 


<0.0001 


n. d. 


30 


0.6 


2 


D785 


D882S 


0.001 


7.5 


0.9 


9 


2 


D785 



References: 

1. JBC (1990) 265:14579-14591 

2. JBC (1992) 267:8417-8428 

10 3. Eur. J. Biochem (1993) 214:59-65 

4. JBC (1994) 269:13259-13265 

5. Nature (1996) 382:278-281 



15 TABLE 3: Rational mutations in the polymerase region 
A. DNA activity table 



IdT %Tth %Taq4M 


[ HP X 


TthDNRXHT 


31.91 


100% 


83% 


3.81 


101.9 


TthDNRXHT 
H641A 


23.61 


74% 


62% 


5.32 


221.24 


TthDNRXHT 
R748A 


22.1 


69% 


58% 


4.39 


88.17 


TthDNRXHT 
H786A 


34.31 


108% 


90% 


7.75 


185.35 


TthDNRXHT 


32.1 


101% 


84% 


5.7 


332.8 
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H786A/G506K/ 
O509K (AKK) 












TaqDNRXHT 
W417L/G418K/ 
E507Q/H784A 
(Taq 4M) 


38.23 


120% 


100% 


68.21 


1100.18 


Taq 4M G504K 


36.04 


113% 


94% 


31.76 


417.40 


Taq4MH639A 


42.95 


135% 


112% 


91.46 


2249.67 


Taq 4M R587A 


44.78 


140% 


117% 


143.0 


252.69 


Taq DN RX HT 

W417L/G418K/ 

G499R/A502K/ 

I503L/K504N/ 

E07K/H784A 

(TaqSM) 


43.95 


138% 


115% 


122.53 


346.56 


TaqSS R677A 


32.3 


101% 


84% 


206.9 


2450.0 



B. RNA activity table 





TrTl 


%Tth %Taq4M 


Tth DN RX HT 


0.89 


100% 


34% 










Tth DN RX HT 


1.18 


133% 


45% 


H641A 








TthDNRXHT 


1.34 


151% 


51% 


R748A 








TthDNRXHT 


1.31 


147% 


49% 


H786A 








TthDNRXHT 


1.59 


179% 


60% 


H786A/G506K/ 








O509K(AKK) 








TaqDNRXHT 


2.65 


298% 


100% 


W417L/G418K/ 








E507Q/H784A 
(Taq 4M) 








Taq 4M G504K 


2.76 


310% 


114% 


Taq4MH639A 


3.89 


437% 


147% 


Taq4MR587A 


3.13 


352% 


118% 


TaqDNRXHT 


4.00 


450% 


151% 


W417L/G418K/ 








G499R/A502K/ 








I503L/K504N/ 








E07K/H784A 








(TaqSM) 








TaqSS R677A 


2.22 


249% 


84% 



5 
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TABLE 4: Rational arch mutations 



DNA activity table 



IdT %Tth %Taq4M HP X 



Taq4M 
P88E/P90E 


10.20 


32% 


27% 


2.00 


97.00 


Taq4M 
G80E 


26.30 


82% 


69% 


103.6 


2900 


Taq4M 
L109F/A110T 


36.45 


114% 


95% 


19.71 


749.69 
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RNA activity table 



IrTl %Tth %1 


raq4M 


Taq4M 
P88E/P90E 


Taq4M 
P88E/P90E 


0.10 


11% 


4% 


Taq4M 
G80E 


Taq4M 
G80E 


3.11 


349% 


117% 


Taq4M 
L109F/A110T 


Taq4M 
L109F/A110T 


2.45 


275% 


92% 



10 

TABLE 5: Arch/thumb combinations 



DNA activity table 



IdT %Tth %Taq4M HP X 



TaqW417L/ 

G418K/E507K/ 

H784A/L109F/ 

A110T/G499R/ 

A502K/I503L/ 

G504K/E507K/ 

T514S (Taq SS) 


63.33 


198% 


166% 


177.05 


202.32 


Taq P88E/P90E/ 

W417L/G418K/ 

G499R/A502Ky 

I503L/G504K/ 

E507K/T514S/ 

H784A 


36.48 


114% 


95% 


9.44 


70.35 



15 
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RNA activity table 

IrTl %Tth %Taq4M 



Taq W417L/ 

G418K/E507K/ 

H784A/L109F/ 

A110T/G499R/ 

A502K/I503L/ 

G504K/E507Ky 

T514S (Taq SS) 


3.16 


355% 


119% 


Taq P88E/P90E/ 

W417L/G418K/ 

G499R/A502K/ 

I503L/G504K/ 

E507K/T514S/ 

H784A 


0.22 


25% 


8% 



TABLE 6: Helix-hairpin-helix random mutagenesis 
DNA activity table 



IdT %Tth %Taq4M HP X 



TaqSS 
K198N 


23.4 


73% 


61% 


25.7 


1233.1 


TaqSS 
A205O 


25.6 


80% 


67% 


13.4 


699.1 


TaqSS 
T204P 


11.2 


35% 


29% 


1.9 


209.4 


TaqSS 

I200M/A205G 


16.8 


53% 


44% 


7.8 


597.2 


TaqSS 
K203N 


25.9 


81% 


68% 


36.6 


1429.8 


TthDNRXHT 
H786A/P197R/K200R 


10.7 


33% 


28% 


3.2 


66.3 


TthDNRXHT 
H786A/K205Y 


11.5 


36% 


30% 


6.1 


327.5 


TthDNRXHT 
H786A/G203R 


18.3 


57% 


48% 


2.1 


98.8 



10 



15 
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RNA activity table 



IrTl %Tth %Taq4M 


TaqSS 
K198N 




1 J / /o 


40/0 


TaqSS 
A205Q 


A /TO 

0.62 


7U% 




TaqSS 
T204P 


U.io 


/I no/ 


1470 


TaqSS 

I200M/A205G 


0.77 


87% 


29% 


TaqSS 
BL203N 


2.09 


235% 


79% 


TthDNRXHT 
H786A/P197R/K200R 


0.47 


52% 


18% 


TthDNRXHT 
H786A/K205Y 


0.68 


77% 


26% 


TthDNRXHT 
H786A/G203R 


1.61 


180% 


61% 



TABLE 7: Random thumb mutations 
DNA activity table 

IdT %Tth %Taq4M HP X 



TaqDNRXHT 
W417L/G418K/ 
E507K/H784A/G499R/ 
A502K/K504N/(M1-13) 


59.96 


188% 


157% 


133.65 


907.41 


TaqDNRXHT 
W417L/G418K/ 
/H784A/L500I/Q507H 
A502K/G504K(Ml-36) 


46.74 


146% 


122% 


123.11 


822.61 


TaqDNRXHT 
W417L/G418K/G499R/ 
A502K/G504K/E507K/ 
H784A/T514S (M2-24) 


85.7 


269% 


224% 


369.96 


3752.12 


TaqDNRXHT 
W417L/G418K/G499R/ 
A502K/G504K/E507Ky 
H784AA^518L (M2-06) 


76.7 


240% 


201% 


355.87 


2038.19 
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RNA activity table 

IrTl %Tth %Taq4M 



TaqDNRXHT 
W417L/G418K/ 
E507K/H784A/G499R/ 
A502K/K504N/(M1-13) 


2.55 


287% 


96% 


TaqDNRXHT 
W417L/G418K/ 
/H784A/L500I/Q507H 
A502K/G504K(Ml-36) 


2.71 


304% 


102% 


TaqDNRXHT 
W417L/G418K/G499R/ 
A502K/G504K/E507K/ 
H784AT514S (M2-24) 


4.43 


498% 


167% 


TaqDNRXHT 
W417L/G418K/G499R/ 
A502K/G504K/E507K/ 
H784AA^518L(M2-06) 


3.56 


400% 


134% 



TABLES: Chimeric mutants 
A. DNA activity table 





IdT2 


%TthAKK 


HP 


X 


TthAKK 


34.18 


100% 


5 


393 


Taq 4M G504K 


40.19 


105% 


28 


1991 


TfiDN2M 


36.60 


106% 


289 


1326 


Tsc DN 2M 


25.49 


75% 


283 


2573 


TaqTthAKK 


63.89 


187% 


32 


1658 


TthTaq 4M G504K 


25.03 


73% 


8 


627 


TfiTthAKK 


34.13 


100% 


15 


459 


TscTthAKK 


35.23 


103% 


29 


2703 


TfiTaq 4M G504K 


35.69 


104% 


37 


872 


TscTaq 4M 
G504K 


30.04 


88% 


25 


2008 



B. RNA activity table 



IrT3 %TthAKK 



TthAKK 


2.27 


100% 


Taq 4M G504K 


2.31 


102% 


TfiDN2M 


0.20 


9% 


Tsc DN 2M 


0.29 


13% 
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TaqTthAKK 


6.81 


300% 


TthTaq 4M 
G504K 


1.09 


48% 


TfiTthAKK 


1.24 


55% 


TscTthAKK 


9.65 


4.25% 


TfiTaq 4M G504K 


1.05 


46% 


TscTaq 4M 
G504K 


2.95 


130% 



All publications and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the described 

5 methods and systems of the invention will be apparent to those skilled in the art without 
departing fi^om the scope and spirit of the invention. Although the invention has been 
described in connection with specific preferred embodiments, it should be understood 
that the invention as claimed should not be xmduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention 

10 which are obvious to those skilled in the relevant fields are intended to be within the 
scope of the following claims. 
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SEQUENCE LISTING 
<110> Third Wave Technologies 
<120> Detection of RNA 



<13 0> FORS 
<160> 708 

<170> Patentin version 3.0 

<210> 1 
<211> 834 
<212> PRT 

<213> Thermus aquaticus 
<400> 1 

Met Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
15 10 15 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly 
20 25 30 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 
35 40 45 

Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala Val Phe 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 



Glu Val Pro Gly 
115 

Lys Ala Glu Lys 
130 

Asp Leu Tyr Gin 
14 5 

Gly His Leu lie 



Pro Glu Gin Trp 
180 

Asn Leu Pro Gly 
195 



Tyr Glu Ala Asp 
120 

Glu Gly Tyr Glu 
135 

Leu Val Ser Asp 
150 

Thr Pro Glu Trp 
165 

Val Asp Phe Arg 



Val Lys Gly He 
200 



Asp Val Leu Ala 



Val Arg He Leu 
140 

Arg Val Ala Val 
155 

Leu Trp Glu Lys 
170 

Ala Leu Val Gly 
185 

Gly Glu Lys Thr 



Thr Leu Ala Lys 
12 5 

Thr Ala Asp Arg 



Leu His Pro Glu 
160 

Tyr Gly Leu Arg 
175 

Asp Pro Ser Asp 
190 

Ala Leu Lys Leu 
205 



Z86 



Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 



Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu Glu Asp 
225 230 235 240 

Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly Leu Arg 
260 265 270 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro Pro Pro 
290 295 300 

Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met Trp 
305 310 315 320 

Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val His Arg 
325 330 335 

Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 
340 345 350 

Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 
355 360 365 

Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 
370 375 380 

Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp 
385 390 395 400 

Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg 
405 410 415 

Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 
450 455 460 

Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly 
500 505 510 
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Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 
515 520 525 

Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys 
530 535 540 

Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 
595 600 605 

Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He 
625 630 635 640 

His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val 
645 650 655 

Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr 
675 680 685 

Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 
725 730 735 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 
770 775 780 

Val His Asp Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
785 790 795 800 

Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 
805 810 815 
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Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 
820 825 830 



Lys Gly 



<210> 2 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 2 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
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210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Ly.s Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
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515 



520 



525 



He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 



291 



820 



825 



830 



Gly His His His His His His 
835 

<210> 3 
<211> 842 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 3 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 
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Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 



Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Leu Gly Leu Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu 
405 410 415 

Phe Ala Asn Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp 
420 425 430 

Leu Tyr Arg Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser 
450 455 460 

Leu Glu Val Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 
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Ala His Pro He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Ser Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp 
595 600 605 

Leu Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg 
625 630 635 640 

Asp He His Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Glu Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala 
785 790 795 800 

Glu Ala Val Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu 
820 825 830 
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Ser Ala Lys Glu His His His His His His 
835 840 



<210> 4 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 4 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 
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Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 



Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 
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Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 



Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 
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<210> 5 

<211> 839 

<212> PRT 

<213> Artificia 

<220> 

<223> Synthetic 

<400> 5 

Met Asn Ser Gly 
1 

Leu Val Asp Gly 
20 

Gly Leu Thr Thr 
35 

Ala Lys Ser Leu 
50 

Val Val Phe Asp 
65 

Gly Tyr Lys Ala 



Leu Ala Leu lie 
100 

Glu Val Pro Gly 
115 

Lys Ala Glu Lys 
130 

Asp Leu Tyr Gin 
145 

Gly Tyr Leu lie 



Pro Asp Gin Trp 
180 

Asn Leu Pro Gly 
195 

Leu Glu Glu Trp 
210 

Leu Lys Pro Ala 
225 

Lys Leu Ser Trp 



Met Leu Pro Leu 
5 

His His Leu Ala 



Ser Arg Gly Glu 
40 

Leu Lys Ala Leu 
55 

Ala Lys Ala Pro 
70 

Gly Arg Ala Pro 
85 

Lys Glu Leu Val 



Tyr Glu Ala Asp 
120 

Glu Gly Tyr Glu 
135 

Leu Leu Ser Asp 
150 

Thr Pro Ala Trp 
165 

Ala Asp Tyr Arg 



Val Lys Gly lie 
200 

Gly Ser Leu Glu 
215 

lie Arg Glu Lys 
230 

Asp Leu Ala Lys 



Phe Glu Pro Lys 
10 

Tyr Arg Thr Phe 
25 

Pro Val Gin Ala 



Lys Glu Asp Gly 
60 

Ser Phe Arg His 
75 

Thr Pro Glu Asp 
90 

Asp Leu Leu Gly 
105 

Asp Val Leu Ala 



Val Arg lie Leu 
140 

Arg lie His Val 
155 

Leu Trp Glu Lys 
170 

Ala Leu Thr Gly 
185 

Gly Glu Lys Thr 



Ala Leu Leu Lys 
220 

lie Leu Ala His 
235 

Val Arg Thr Asp 



Gly Arg Val Leu 
15 

His Ala Leu Lys 
30 

Val Tyr Gly Phe 
45 

Asp Ala Val He 



Glu Ala Tyr Gly 
80 

Phe Pro Arg Gin 
95 

Leu Ala Arg Leu 
110 

Ser Leu Ala Lys 
125 

Thr Ala Asp Lys 



Leu His Pro Glu 
160 

Tyr Gly Leu Arg 
175 

Asp Glu Ser Asp 
190 

Ala Arg Lys Leu 
205 

Asn Leu Asp Arg 



Met Asp Asp Leu 
240 

Leu Pro Leu Glu 



Z98 



245 



250 



255 



Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
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545 



550 



555 



560 



Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Gly His His His His His His 
835 

<210> 6 



300 



<211> 839 
<212> PRT 
<213> Artificial 



<220> 

<223> Synthetic 
<400> 6 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 
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Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 



Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 
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Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 



835 



<210> 
<211> 



7 

839 
PRT 



<212> 
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<213> Artificial 



<220> 

<223> Synthetic 
<400> 7 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 
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Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 



Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 



305 



Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 



Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 



835 



<210> 
<211> 
<212> 
<213> 



8 

839 
PRT 

Artificial 
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<220> 

<223> Synthetic 
<400> 8 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
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275 



280 



285 



Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
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580 



585 



590 



Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 9 
<211> 839 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic 
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<400> 9 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 
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Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 



Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 
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Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 



Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 10 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 10 
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Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 



313 



Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp 
595 600 605 
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Leu Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 



His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg 
625 630 635 640 

Asp lie His Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Glu Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala 
785 790 795 800 

Glu Ala Val Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Glu His His His His His His 
835 840 



<210> 


11 


<211> 


842 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


11 



Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
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X 5 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
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305 



310 315 320 



Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Leu Gly Leu Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu 
405 410 415 

Phe Ala Asn Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp 
420 425 430 

Leu Tyr Arg Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser 
450 455 460 

Leu Glu Val Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Ser Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
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610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 . 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 12 

<211> 833 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 12 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 



318 



Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 



Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 
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Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 



Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu ,Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 
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Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie His 
625 630 635 640 



Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asp Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 



Gly 



<210> 13 

<211> 833 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 13 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 
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Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 



Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 
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Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 



Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 
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Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 



Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asp Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu 



<210> 


14 


<211> 


833 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


14 



Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
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35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
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340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
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645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asp Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Gly 



<210> 15 

<211> 833 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 15 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 



327 



Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 
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Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 



Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 
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Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asp Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 



Glu 



<210> 16 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 16 

Met Asn Ser Gly Met 
1 5 

Leu Val Asp Gly His 
20 

Gly Leu Thr Thr Ser 
35 

Ala Lys Ser Leu Leu 
50 



Leu Pro Leu Phe Glu Pro 
10 

His Leu Ala Tyr Arg Thr 
25 

Arg Gly Glu Pro Val Gin 
40 

Lys Ala Leu Lys Glu Asp 
55 



Lys Gly Arg Val Leu 
15 

Phe His Ala Leu Lys 
30 

Ala Val Tyr Gly Phe 
45 

Gly Asp Ala Val He 
60 
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Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 
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Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly His Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 



332 



Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 





835 


<210> 
<211> 
<212> 
<213> 


17 

839 

PRT 

Artificial 


<220> 
<223> 


Synthetic 


<400> 


17 


Met Asn Ser Gly Met 
1 5 


Leu Val Asp Gly His 
20 



10 15 



25 30 



Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
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65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 55 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
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370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
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675 



680 



685 



Glu Ala Gin Ala 
690 

Arg Ala Trp lie 
705 

Val Glu Thr Leu 



Arg Val Lys Ser 
740 

Pro Val Gin Gly 
755 

Leu Phe Pro Arg 
770 

His Asn Glu Leu 
785 

Ala Arg Leu Ala 



Pro Leu Glu Val 
820 



Phe lie Glu Arg 
695 

Glu Lys Thr Leu 
710 

Phe Gly Arg Arg 
725 

Val Arg Glu Ala 



Thr Ala Ala Asp 
760 

Leu Glu Glu Met 
775 

Val Leu Glu Ala 
790 

Lys Glu Val Met 
805 

Glu Val Gly lie 



Tyr Phe Gin Ser 
700 

Glu Glu Gly Arg 
715 

Arg Tyr Val Pro 
730 

Ala Glu Arg Met 
745 

Leu Met Lys Leu 



Gly Ala Arg Met 
780 

Pro Lys Glu Arg 
795 

Glu Gly Val Tyr 
810 

Gly Glu Asp Trp 
825 



Phe Pro Lys Val 



Arg Arg Gly Tyr 
720 

Asp Leu Glu Ala 
735 

Ala Phe Asn Met 
750 

Ala Met Val Lys 
765 

Leu Leu Gin Val 



Ala Glu Ala Val 
800 

Pro Leu Ala Val 
815 

Leu Ser Ala Lys 
830 



Glu His His His His His His 
835 



<210> 18 

<211> 839 

<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 18 

Met Asn Ser Gly Met Leu Pro Leu 
1 5 

Leu Val Asp Gly His His Leu Ala 
20 

Gly Leu Thr Thr Ser Arg Gly Glu 
35 40 

Ala Lys Ser Leu Leu Lys Ala Leu 
50 55 

Val Val Phe Asp Ala Lys Ala Pro 
65 70 



Phe Glu Pro Lys Gly Arg Val Leu 
10 15 

Tyr Arg Thr Phe His Ala Leu Lys 
25 30 

Pro Val Gin Ala Val Tyr Gly Phe 
45 

Lys Glu Asp Gly Asp Ala Val lie 
60 

Ser Phe Arg His Glu Ala Tyr Gly 
75 80 
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Gly Tyr Lys Ala Gly 
85 

Leu Ala Leu lie Lys 
100 

Glu Val Pro Gly Tyr 
115 

Lys Ala Glu Lys Glu 
130 

Asp Leu Tyr Gin Leu 
145 

Gly Tyr Leu lie Thr 
165 

Pro Asp Gin Trp Ala 
180 

Asn Leu Pro Gly Val 
195 

Leu Glu Glu Trp Gly 
210 

Leu Lys Pro Ala lie 
225 

Lys Leu Ser Trp Asp 
245 

Val Asp Phe Ala Lys 
260 

Phe Leu Glu Arg Leu 
275 

Leu Glu Ser Pro Lys 
290 

Gly Ala Phe Val Gly 
305 

Asp Leu Leu Ala Leu 
325 

Pro Glu Pro Tyr Lys 
340 

Leu Ala Lys Asp Leu 
355 

Pro Pro Gly Asp Asp 
370 



Arg Ala Pro Thr Pro Glu 
90 

Glu Leu Val Asp Leu Leu 
105 

Glu Ala Asp Asp Val Leu 
120 

Gly Tyr Glu Val Arg lie 
135 

Leu Ser Asp Arg lie His 
150 155 

Pro Ala Trp Leu Trp Glu 
170 

Asp Tyr Arg Ala Leu Thr 
185 

Lys Gly lie Gly Glu Lys 
200 

Ser Leu Glu Ala Leu Leu 
215 

Arg Glu Lys lie Leu Ala 
230 235 

Leu Ala Lys Val Arg Thr 
250 

Arg Arg Glu Pro Asp Arg 
265 

Glu Phe Gly Ser Leu Leu 
280 

Ala Leu Glu Glu Ala Pro 
295 

Phe Val Leu Ser Arg Lys 
310 315 

Ala Ala Ala Arg Gly Gly 
330 

Ala Leu Arg Asp Leu Lys 
345 

Ser Val Leu Ala Leu Arg 
360 

Pro Met Leu Leu Ala Tyr 
375 



Asp Phe Pro Arg Gin 
95 

Gly Leu Ala Arg Leu 
110 

Ala Ser Leu Ala Lys 
125 

Leu Thr Ala Asp Lys 
140 

Val Leu His Pro Glu 
160 

Lys Tyr Gly Leu Arg 
175 

Gly Asp Glu Ser Asp 
190 

Thr Ala Arg Lys Leu 
205 

Lys Asn Leu Asp Arg 
220 

His Met Asp Asp Leu 
240 

Asp Leu Pro Leu Glu 
255 

Glu Arg Leu Arg Ala 
270 

His Glu Phe Gly Leu 
285 

Trp Pro Pro Pro Glu 
300 

Glu Pro Met Trp Ala 
320 

Arg Val His Arg Ala 
335 

Glu Ala Arg Gly Leu 
350 

Glu Gly Leu Gly Leu 
365 

Leu Leu Asp Pro Ser 
380 
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Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 



Leu Leu Lys Arg Leu 
420 

Glu Val Glu Arg Pro 
435 

Gly Val Arg Arg Asp 
450 

Ala Glu Glu He Arg 
465 

His Pro Phe Asn Leu 
485 



Glu Gly Glu Glu Arg Leu 
425 

Leu Ser Ala Val Leu Ala 
440 

Val Ala Tyr Leu Gin Ala 
455 

Arg Leu Glu Glu Glu Val 
470 475 

Asn Ser Arg Asp Gin Leu 
490 



Leu Trp Leu Tyr Arg 
430 

His Met Glu Ala Thr 
445 

Leu Ser Leu Glu Leu 
460 

Phe Arg Leu Ala Gly 
480 

Glu Arg Val Leu Phe 
495 



Asp Glu Leu Arg 
500 

Arg Ser Thr Ser 
515 



Leu Pro Ala Leu 

Ala Ala Val Leu 
520 



Gly Lys Thr Gin 
505 

Glu Ala Leu Arg 



Lys Thr Gly Lys 
510 

Glu Ala His Pro 
525 



He Val Glu Lys 
530 

Thr Tyr Val Asp 
545 

Leu His Thr Arg 



Ser Ser Asp Pro 
580 

Gin Arg He Arg 
595 

Ala Leu Asp Tyr 
610 

Gly Asp Glu Asn 
625 

Thr Glu Thr Ala 



He Leu Gin His 
535 

Pro Leu Pro Ser 
550 

Phe Asn Gin Thr 
565 

Asn Leu Gin Asn 



Arg Ala Phe He 
600 

Ser Gin He Glu 
615 

Leu He Arg Val 
630 

Ser Trp Met Phe 
645 



Arg Glu Leu Thr 
540 

Leu Val His Pro 
555 

Ala Thr Ala Thr 
570 

He Pro Val Arg 
585 

Ala Glu Glu Gly 



Leu Arg Val Leu 
620 

Phe Gin Glu Gly 
635 

Gly Val Pro Arg 
650 



Lys Leu Lys Asn 



Arg Thr Gly Arg 
560 

Gly Arg Leu Ser 
575 

Thr Pro Leu Gly 
590 

Trp Leu Leu Val 
605 

Ala His Leu Ser 



Arg Asp He His 
640 

Glu Ala Val Asp 
655 



Pro Leu Met Arg Arg Ala Ala Lys 
660 

Gly Met Ser Ala His Arg Leu Ser 
675 680 



Thr He Asn Phe Gly Val Leu Tyr 
665 670 

Gin Glu Leu Ala He Pro Tyr Glu 
685 
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Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 19 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 19 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 
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Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
lis 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 
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Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 



Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 
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Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 20 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 20 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
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100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
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405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
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705 



710 715 720 



Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 21 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 21 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 
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Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 



Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Plie Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 
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Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 
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Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 



Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 22 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 22 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 
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Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 
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Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 
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Arg Val Lys Ser 
740 

Pro Val Gin Gly 
755 

Leu Phe Pro Arg 
770 

His Asn Glu Leu 
785 

Ala Arg Leu Ala 



Pro Leu Glu Val 
820 

Glu His His His 
835 



Val Arg Glu Ala 



Thr Ala Ala Asp 
760 

Leu Glu Glu Met 
775 

Val Leu Glu Ala 
790 

Lys Glu Val Met 
805 

Glu Val Gly lie 



His His His 



Ala Glu Arg Met 
745 

Leu Met Lys Leu 



Gly Ala Arg Met 
780 

Pro Lys Glu Arg 
795 

Glu Gly Val Tyr 
810 

Gly Glu Asp Trp 
825 



Ala Phe Asn Met 
750 

Ala Met Val Lys 
765 

Leu Leu Gin Val 



Ala Glu Ala Val 
800 

Pro Leu Ala Val 
815 

Leu Ser Ala Lys 
830 



<210> 23 

<211> 839 

<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 23 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 9S 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
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130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
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435 



440 



445 



Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
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740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 24 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 24 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
lis 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 
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Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 
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Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 450 
455 460 

Ala Glu Glu He Ala Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 
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Pro Val Gin Gly 
755 

Leu Phe Pro Arg 
770 

His Asn Glu Leu 
785 

Ala Arg Leu Ala 



Pro Leu Glu Val 
820 



Thr Ala Ala Asp 
760 

Leu Glu Glu Met 
775 

Val Leu Glu Ala 
790 

Lys Glu Val Met 
805 

Glu Val Gly lie 



Leu Met Lys Leu 



Gly Ala Arg Met 
780 

Pro Lys Glu Arg 
795 

Glu Gly Val Tyr 
810 

Gly Glu Asp Trp 
825 



Ala Met Val Lys 
765 

Leu Leu Gin Val 



Ala Glu Ala Val 
800 

Pro Leu Ala Val 
815 

Leu Ser Ala Lys 
830 



Glu His His His His His His 
835 



<210> 25 

<211> 839 

<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 25 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 
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Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 



Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 



Leu Lys Pro Ala He Arg Glu Lys 
225 230 

Lys Leu Ser Trp Asp Leu Ala Lys 
245 

Val Asp Phe Ala Lys Arg Arg Glu 
260 



He Leu Ala His Met Asp Asp Leu 
235 240 

Val Arg Thr Asp Leu Pro Leu Glu 
250 255 

Pro Asp Arg Glu Arg Leu Arg Ala 
265 270 



Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 
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Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 
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Leu Phe Pro Arg Leu Gin Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 



His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 26 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 26 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
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165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
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465 



470 475 480 



His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 VOO 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
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770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 27 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 27 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 



Lys Ala Glu Lys Glu Gly Tyr Glu 
130 135 

Asp Leu Tyr Gin Leu Leu Ser Asp 
145 150 

Gly Tyr Leu lie Thr Pro Ala Trp 
165 



Val Arg lie Leu Thr Ala Asp Lys 

140 

Arg lie His Val Leu His Pro Glu 
155 160 

Leu Trp Glu Lys Tyr Gly Leu Arg 
170 175 
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Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 
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His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 



Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 
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His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 835 

<210> 28 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 28 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 €0 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 
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Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 
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Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Oly Lys 
500 505 510 



Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 
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Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 



Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 29 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 29 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 



369 



Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 



Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Piie Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 
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Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 ' 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 
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Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 30 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 30 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 SO 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
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210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 

450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
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515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
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820 825 830 



Glu His His His His His His 
835 

<210> 31 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 31 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
IS 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 
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Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 



Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 
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He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 



377 



Glu His His His His His His 
835 



<210> 32 

<211> 839 

<212> PRT 

.<213> Artificial 

<220> 

<223> Synthetic 
<400> 32 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 
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hys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 



Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 



He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 
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Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



380 



<210> 33 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 33 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
X 5 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 GO 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 

115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 
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Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 
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Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 



Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 34 
<211> 842 
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<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 34 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 



Leu Leu Val Asp 
20 

Lys Gly Leu Thr 
35 

Phe Ala Lys Ser 
50 

Val Phe Val Val 
65 

Tyr Glu Ala Tyr 



Arg Gin Leu Ala 
100 

Arg Leu Glu Val 
115 



Gly His His Leu 



Thr Ser Arg Gly 
40 

Leu Leu Lys Ala 
55 

Phe Asp Ala Lys 
70 

Lys Ala Gly Arg 
85 

Leu lie Lys Glu 



Pro Gly Tyr Glu 
120 



Ala Tyr Arg Thr 
25 

Glu Pro Val Gin 



Leu Lys Glu Asp 
60 

Ala Pro Ser Phe 
75 

Ala Pro Thr Pro 
90 

Leu Val Asp Leu 
105 

Ala Asp Asp Val 



Phe Phe Ala Leu 
30 

Ala Val Tyr Gly 
45 

Gly Tyr Lys Ala 



Arg His Glu Ala 
80 

Glu Asp Phe Pro 
95 

Leu Gly Phe Thr 
110 

Leu Ala Thr Leu 
125 



Ala Lys Lys Ala 
130 

Asp Arg Asp Leu 
145 

Pro Glu Gly His 



Leu Arg Pro Glu 
180 

Ser Asp Asn Leu 
195 

Lys Leu Leu Lys 
210 

Asp Arg Val Lys 
225 

Glu Asp Leu Arg 



Pro Leu Glu Val 



Glu Lys Glu Gly 
135 

Tyr Gin Leu Val 
150 

Leu lie Thr Pro 
165 

Gin Trp Val Asp 



Pro Gly Val Lys 
200 

Glu Trp Gly Ser 
215 

Pro Glu Asn Val 
230 

Leu Ser Leu Glu 
245 

Asp Leu Ala Gin 



Tyr Glu Val Arg 
140 

Ser Asp Arg Val 
155 

Glu Trp Leu Trp 
170 

Phe Arg Ala Leu 
185 

Gly He Gly Glu 



Leu Glu Asn Leu 
220 

Arg Glu Lys He 
235 

Leu Ser Arg Val 
250 

Gly Arg Glu Pro 



He Leu Thr Ala 



Ala Val Leu His 
160 

Glu Lys Tyr Gly 
175 

Val Gly Asp Pro 
190 

Lys Thr Ala Leu 
205 

Leu Lys Asn Leu 



Lys Ala His Leu 
240 

Arg Thr Asp Leu 
255 

Asp Arg Glu Gly 
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260 



265 



270 



Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 



Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
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565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He Ala Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 35 

<211> 842 

<212> PRT 

<213> Artificial 
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<220> 

<223> Synthetic 



<400> 35 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 
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Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 



388 



Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 



Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Ala Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 36 

<211> 842 

<212> PRT 

<213> Artificial 



<220> 
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<223> Synthetic 



<400> 36 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
1 S 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 
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Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 



Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 
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Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 37 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 37 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
2 5 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 30 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
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290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
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595 



600 



605 



Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 


38 


<211> 


839 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


38 



395 



Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 



Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 
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Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 



Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 
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Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 



Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 39 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 39 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 
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Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 



Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 
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Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 



Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 
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Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He Ala 
625 630 635 640 



Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 


40 


<211> 


839 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


40 



Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
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20 



25 



30 



Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
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325 



330 



335 



Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Ala Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
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625 



630 



635 



640 



Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 


41 


<211> 


839 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


41 



Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 
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Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 
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Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 



Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Lys Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
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645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 42 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 42 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 
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Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 



Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 
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Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 



Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 . 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 
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Pro Leu 


Met 


Arg 
660 


Arg 


Ala 


Ala 


Lys 


Thr He Asn 
665 


Phe 


Gly Val 
670 


Leu 


Tyr 


Gly Met 


Ser 
675 


Ala 


His 


Arg 


Leu 


Ser 
680 


Gin Glu Leu 


Ala 


He 
685 


Pro 


Tyr 


Glu 


Glu Ala 
690 


Gin 


Ala 


Phe 


He 


Glu 
695 


Arg 


Tyr Phe Gin 


Ser 
700 


Phe 


Pro 


Lys 


Val 


Arg Ala 
705 


Trp 


He 


Glu 


Lys 
710 


Thr 


Leu 


Glu Glu Gly Arg Arg Arg 
715 


Gly Tyr 
720 


Val Glu 


Thr 


Leu 


Phe 
725 


Gly Arg Arg Arg Tyr Val 
730 


Pro 


Asp 


Leu 


Glu 
735 


Ala 


Arg Val 


Lys 


Ser 
740 


Val 


Arg 


Glu 


Ala 


Ala Glu Arg 
745 


Met 


Ala 


Phe 
750 


Asn 


Met 


Pro Val 


Gin Gly 
755 


Thr 


Ala 


Ala 


Asp 


Leu Met Lys 


Leu 


Ala 
765 


Met 


Val 


Lys 


Leu Phe 
770 


Pro 


Arg 


Leu 


Glu 


Glu 

77 R 


Met 


Gly Ala Arg 


Met 
780 


Leu 


Leu 


Gin 


Val 


Ala Asn 
785 


Glu 


Leu 


Val 


Leu 
790 


LjIU 


i\±a 


ir^ro i-iys oj.u 
795 


Arg 


Ala 


Glu 


Ala 


Val 
800 


Ala Arg 


Leu 


Ala 


Lys 
805 


Glu 


Val 


Met 


Glu Gly Val 
810 


Tyr 


ir £ O 


Leu 


815 


V Ct J. 


Pro Leu 


Glu 


Val 
820 


Glu 


Val 


Gly 


He 


Gly Glu Asp 
825 


Trp 


Leu 


Ser 
830 


A±a 


Lys 


Glu His 


His 
835 


His 


His 


His 


His 
















<210> 
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43 

839 

PRT 

Artificial 




















<220> 
<223> 


Synthetic 




















<400> 


43 
























Met Asn 
1 


Ser 


Gly Met 
5 


Leu 


Pro 


Leu 


Phe Glu Pro 
10 


Lys 


Gly Arg 


Val 
15 


Leu 


Leu Val 


Asp 


Gly 
20 


His 


His 


Leu 


Ala 


Tyr Arg Thr 
25 


Phe 


His 


Ala 
30 


Leu 


Lys 


Gly Leu 


Thr 
35 


Thr 


Ser Arg Gly Glu 
40 


Pro Val Gin 


Ala 


Val 
45 


Tyr Gly 


Phe 


Ala Lys 
50 


Ser 


Leu 


Leu 


Lys 


Ala 
55 


Leu 


Lys Glu Asp Gly Asp 
60 


Ala 


Val 


He 



Val Val Phe Asp Ala hys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Glu Thr Glu Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 
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Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 



Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 
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Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 



Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 


44 


<211> 


839 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


44 



Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
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65 



70 



75 



80 



Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
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370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 



415 



675 

Glu Ala Gin Ala Phe 
690 

Arg Ala Trp lie Glu 
705 

Val Glu Thr Leu Phe 
725 



680 

lie Glu Arg Tyr Phe Gin 
695 

Lys Thr Leu Glu Glu Gly 
710 715 

Gly Arg Arg Arg Tyr Val 
730 



685 

Ser Phe Pro Lys Val 
700 

Arg Arg Arg Gly Tyr 
720 

Pro Asp Leu Glu Ala 
735 



Arg Val Lys Ser 
740 

Pro Val Gin Gly 
755 

Leu Phe Pro Arg 
770 

Ala Asn Glu Leu 
785 

Ala Arg Leu Ala 



Pro Leu Glu Val 
820 



Val Arg Glu Ala 



Thr Ala Ala Asp 
760 

Leu Glu Glu Met 
775 

Val Leu Glu Ala 
790 

Lys Glu Val Met 
805 

Glu Val Gly lie 



Ala Glu Arg Met 
745 

Leu Met Lys Leu 



Gly Ala Arg Met 
780 

Pro Lys Glu Arg 
795 

Glu Gly Val Tyr 
810 

Gly Glu Asp Trp 
825 



Ala Phe Asn Met 
750 

Ala Met Val Lys 
765 

Leu Leu Gin Val 



Ala Glu Ala Val 
800 

Pro Leu Ala Val 
815 

Leu Ser Ala Lys 
830 



Glu His His His His His His 
835 



<210> 45 

<211> 839 

<212> PRT 

<213> Artificial 



<220> 

<22 3> Synthetic 



<400> 45 



Met Asn Ser Gly 
1 

Leu Val Asp Gly 
20 

Gly Leu Thr Thr 
35 

Ala Lys Ser Leu 
50 

Val Val Phe Asp 
65 



Met Leu Pro Leu 
5 

His His Leu Ala 



Ser Arg Gly Glu 
40 

Leu Lys Ala Leu 
55 

Ala Lys Ala Pro 
70 



Phe Glu Pro Lys 
10 

Tyr Arg Thr Phe 
25 

Pro Val Gin Ala 



Lys Glu Asp Gly 
60 

Ser Phe Arg His 
75 



Gly Arg Val Leu 
15 

His Ala Leu Lys 
30 

Val Tyr Gly Phe 
45 

Asp Ala Val lie 



Glu Ala Tyr Gly 
80 



416 



Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 



Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 



417 



Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 



418 



Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 



Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 


46 


<211> 


839 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


46 


Met Asn Ser Gly Met 


1 


5 



10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 



419 



Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 



420 



Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys He Asn Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 

660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 



421 



Arg Ala Trp lie 
705 

Val Glu Thr Leu 



Arg Val Lys Ser 
740 

Pro Val Gin Gly 
755 

Leu Phe Pro Arg 
770 

Ala Asn Glu Leu 
785 

Ala Arg Leu Ala 



Pro Leu Glu Val 
820 



Glu Lys Thr Leu 
710 

Phe Gly Arg Arg 
725 

Val Arg Glu Ala 



Thr Ala Ala Asp 
760 

Leu Glu Glu Met 
775 

Val Leu Glu Ala 
790 

Lys Glu Val Met 
805 

Glu Val Gly He 



Glu Glu Gly Arg 
715 

Arg Tyr Val Pro 
730 

Ala Glu Arg Met 
745 

Leu Met Lys Leu 



Gly Ala Arg Met 
780 

Pro Lys Glu Arg 
795 

Glu Gly Val Tyr 
810 

Gly Glu Asp Trp 
825 



Arg Arg Gly Tyr 
720 

Asp Leu Glu Ala 
735 

Ala Phe Asn Met 
750 

Ala Met Val Lys 
765 

Leu Leu Gin Val 



Ala Glu Ala Val 
800 

Pro Leu Ala Val 
815 

Leu Ser Ala Lys 
830 



Glu His His His His His His 
835 



<210> 47 

<211> 839 

<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 47 

Met Asn Ser Gly 
1 

Leu Val Asp Gly 
20 

Gly Leu Thr Thr 
35 

Ala Lys Ser Leu 
50 

Val Val Phe Asp 
65 

Gly Tyr Lys Ala 



Leu Ala Leu lie 



Met Leu Pro Leu 
5 

His His Leu Ala 



Ser Arg Gly Glu 
40 

Leu Lys Ala Leu 
55 

Ala Lys Ala Pro 
70 

Gly Arg Ala Pro 
85 

Lys Glu Leu Val 



Phe Glu Pro Lys 
10 

Tyr Arg Thr Phe 
25 

Pro Val Gin Ala 



Lys Glu Asp Gly 
60 

Ser Phe Arg His 
75 

Thr Pro Glu Asp 
90 

Asp Leu Leu Gly 



Gly Arg Val Leu 
15 

His Ala Leu Lys 
30 

Val Tyr Gly Phe 
45 

Asp Ala Val He 



Glu Ala Tyr Gly 
80 

Phe Pro Arg Gin 
95 

Leu Ala Arg Leu 



422 



100 



105 



110 



Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 



423 



405 



410 



415 



Leu Leu Lys Arg Leu 
420 

Glu Val Glu Arg Pro 
435 

Gly Val Arg Leu Asp 
450 

Ala Glu Glu He Ala 
465 

His Pro Phe Asn Leu 
485 

Asp Glu Leu Arg He 
500 

Arg Ser Thr Ser Ala 
515 

He Val Glu Lys He 
530 

Thr Tyr He Asp Pro 
545 

Leu His Thr Arg Phe 
565 

Ser Ser Asp Pro Asn 
580 

Gin Arg He Arg Arg 
595 

Ala Leu Asp Tyr Ser 
610 

Gly Asp Glu Asn Leu 
625 

Thr Glu Thr Ala Ser 
645 

Pro Leu Met Arg Arg 
660 

Gly Met Ser Ala His 
675 

Glu Ala Gin Ala Phe 
690 

Arg Ala Trp He Glu 



Glu Gly Glu Glu Arg Leu 
425 

Leu Ser Ala Val Leu Ala 
440 

Val Ala Tyr Leu Arg Ala 
455 

Arg Leu Glu Ala Glu Val 
470 475 

Asn Ser Arg Asp Gin Leu 
490 

Pro Lys He Lys Lys Thr 
505 

Ala Val Leu Glu Ala Leu 
520 

Leu Gin Tyr Arg Glu Leu 
535 

Leu Pro Asp Leu He His 
550 555 

Asn Gin Thr Ala Thr Ala 
570 

Leu Gin Asn He Pro Val 
585 

Ala Phe He Ala Glu Glu 
600 

Gin He Glu Leu Arg Val 
615 

He Arg Val Phe Gin Glu 
630 635 

Trp Met Phe Gly Val Pro 
650 

Ala Ala Lys Thr He Asn 
665 

Arg Leu Ser Gin Glu Leu 
680 

He Glu Arg Tyr Phe Gin 
695 

Lys Thr Leu Glu Glu Gly 



Leu Trp Leu Tyr Arg 
430 

His Met Glu Ala Thr 
445 

Leu Ser Leu Glu Val 
460 

Phe Arg Leu Ala Gly 
480 

Glu Arg Val Leu Phe 
495 

His Lys Thr Gly Lys 
510 

Arg Glu Ala His Pro 
525 

Thr Lys Leu Lys Ser 
540 

Pro Arg Thr Gly Arg 
560 

Thr Gly Arg Leu Ser 
575 

Arg Thr Pro Leu Gly 
590 

Gly Trp Leu Leu Val 
605 

Leu Ala His Leu Ser 
620 

Gly Arg Asp He His 
640 

Arg Glu Ala Val Asp 
655 

Phe Gly Val Leu Tyr 
670 

Ala He Pro Tyr Glu 
685 

Ser Phe Pro Lys Val 
700 

Arg Arg Arg Gly Tyr 



424 



705 



710 



715 



720 



Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 


48 


<211> 


839 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


48 



Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 



425 



Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 



Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 



426 



Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 



Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 



427 



Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 



Arg Val Lys Ser 
740 

Pro Val Gin Gly 
755 

Leu Phe Pro Arg 
770 

Ala Asn Glu Leu 
785 

Ala Arg Leu Ala 



Pro Leu Glu Val 
820 



Val Arg Glu Ala 



Thr Ala Ala Asp 
760 

Leu Glu Glu Met 
775 

Val Leu Glu Ala 
790 

Lys Glu Val Met 
805 

Glu Val Gly lie 



Ala Glu Arg Met 
745 

Leu Met Lys Leu 



Gly Ala Arg Met 
780 

Pro Lys Glu Arg 
795 

Glu Gly Val Tyr 
810 

Gly Glu Asp Trp 
825 



Ala Phe Asn Met 
750 

Ala Met Val Lys 
765 

Leu Leu Gin Val 



Ala Glu Ala Val 
800 

Pro Leu Ala Val 
815 

Leu Ser Ala Lys 
830 



Glu His His His His His His 
835 



<210> 49 

<211> 839 

<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 49 

Met Asn Ser Gly 
1 

Leu Val Asp Gly 
20 

Gly Leu Thr Thr 
35 

Ala Lys Ser Leu 
50 

Val Val Phe Asp 
65 

Gly Tyr Lys Ala 



Leu Ala Leu lie 
100 

Glu Val Pro Gly 
115 



Met Leu Pro Leu 
5 

His His Leu Ala 



Ser Arg Gly Glu 
40 

Leu Lys Ala Leu 
55 

Ala Lys Ala Pro 
70 

Gly Arg Ala Pro 
85 

Lys Glu Leu Val 



Tyr Glu Ala Asp 
120 



Phe Glu Pro Lys 
10 

Tyr Arg Thr Phe 
25 

Pro Val Gin Ala 



Lys Glu Asp Gly 
60 

Ser Phe Arg His 
75 

Thr Pro Glu Asp 
90 

Asp Leu Leu Gly 
105 

Asp Val Leu Ala 



Gly Arg Val Leu 
15 

His Ala Leu Lys 
30 

Val Tyr Gly Phe 
45 

Asp Ala Val lie 



Glu Ala Tyr Gly 
80 

Phe Pro Arg Gin 
95 

Leu Ala Arg Leu 
110 

Ser Leu Ala Lys 
125 



428 



Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 
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Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Leu Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 
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Arg Val Lys Ser 
740 

Pro Val Gin Gly 
755 

Leu Phe Pro Arg 
770 

Ala Asn Glu Leu 
785 

Ala Arg Leu Ala 



Pro Leu Glu Val 
820 



Val Arg Glu Ala 



Thr Ala Ala Asp 
760 

Leu Glu Glu Met 
775 

Val Leu Glu Ala 
790 

Lys Glu Val Met 
805 

Glu Val Gly He 



Ala Glu Arg Met 
745 

Leu Met Lys Leu 



Gly Ala Arg Met 
780 

Pro Lys Glu Arg 
795 

Glu Gly Val Tyr 
810 

Gly Glu Asp Trp 
825 



Ala Phe Asn Met 
750 

Ala Met Val Lys 
765 

Leu Leu Gin Val 



Ala Glu Ala Val 
800 

Pro Leu Ala Val 
815 

Leu Ser Ala Lys 
830 



Glu His His His His His His 
835 



<210> 50 

<211> 842 

<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 50 



Met Asn Ser Glu 
1 

Leu Leu Val Asp 
20 

Lys Gly Leu Thr 
35 

Phe Ala Lys Ser 
SO 

Val Phe Val Val 
65 

Tyr Glu Ala Tyr 



Arg Gin Leu Ala 
100 

Arg Leu Glu Val 
115 

Ala Lys Lys Ala 



Ala Met Leu Pro 
5 

Gly His His Leu 



Thr Ser Arg Gly 
40 

Leu Leu Lys Ala 
55 

Phe Asp Ala Lys 
70 

Lys Ala Gly Arg 
85 

Leu He Lys Glu 



Pro Gly Tyr Glu 
120 

Glu Lys Glu Gly 



Leu Phe Glu Pro 
10 

Ala Tyr Arg Thr 
25 

Glu Pro Val Gin 



Leu Lys Glu Asp 
60 

Ala Pro Ser Phe 
75 

Ala Pro Thr Pro 
90 

Leu Val Asp Leu 
105 

Ala Asp Asp Val 



Tyr Glu Val Arg 



Lys Gly Arg Val 
15 

Phe Phe Ala Leu 
30 

Ala Val Tyr Gly 
45 

Gly Tyr Lys Ala 



Arg His Glu Ala 
80 

Glu Asp Phe Pro 
95 

Leu Gly Phe Thr 
110 

Leu Ala Thr Leu 
125 

He Leu Thr Ala 
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130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Arg Gly Val Arg Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
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435 



440 



445 



Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
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740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 51 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 51 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 
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Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 



Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Tyr Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 
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Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 
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Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 



Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 52 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 52 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 
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Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Arg Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Aap Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 
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Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 



439 



Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 53 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 53 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
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165 



170 



175 



Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
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465 



470 



475 



480 



His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
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770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 54 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<22 3> Synthetic 
<400> 54 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Glu Thr Glu Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 
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Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 



Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 



Asn Thr Thr Pro 
385 

Glu Glu Ala Gly 



Leu Leu Lys Arg 
420 

Glu Val Glu Arg 
435 



Glu Gly Val Ala 
390 

Glu Arg Ala Ala 
405 

Leu Glu Gly Glu 



Pro Leu Ser Ala 
440 



Arg Arg Tyr Gly 
395 

Leu Ser Glu Arg 
410 

Glu Arg Leu Leu 
425 

Val Leu Ala His 



Gly Glu Trp Thr 
400 

Leu Phe Ala Asn 
415 

Trp Leu Tyr Arg 
430 

Met Glu Ala Thr 
445 



Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 

465 470 475 480 
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His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 



Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 

740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 



445 



Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 



Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 


55 


<211> 


839 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


55 



Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 
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Asn Leu Pro Gly Val Asn Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 



Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 
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Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 



Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 
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Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 56 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 56 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Gin Arg Lys Leu 
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195 



200 



205 



Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 



450 



500 



505 



510 



Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
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805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 57 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<22 3> Synthetic 
<400> 57 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly Met Gly Glu Lys Thr Gly Arg Lys Leu 
195 200 205 



452 



Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 



Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 



453 



Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 



He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 
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Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 



Glu His His His His His His 
835 



<210> 


58 


<211> 


839 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


58 



Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Asn Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 



455 



Leu Lys Pro Ala lie Arg Glu Lys lie hen Ala His Met Asp Asp Leu 
225 230 235 240 



Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 



456 



He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 V35 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 
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Glu His His His His His His 
835 



<210> 59 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<22 3> Synthetic 
<400> 59 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Pro Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
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225 



230 235 240 



Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 

450 455 460 

Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 

465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 
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Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 



Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



460 



<210> 60 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 60 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
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245 



250 



255 



Val Asp Phe Ala 
260 

Phe Leu Glu Arg 
275 

Leu Glu Ser Pro 
290 

Gly Ala Phe Val 
305 

Asp Leu Leu Ala 



Pro Glu Pro Tyr 
340 

Leu Ala Lys Asp 
355 



Lys Arg Arg Glu 



Leu Glu Phe Gly 
280 

Lys Ala Leu Glu 
295 

Gly Phe Val Leu 
310 

Leu Ala Ala Ala 
325 

Lys Ala Leu Arg 



Leu Ser Val Leu 
360 



Pro Asp Arg Glu 
265 

Ser Leu Leu His 



Glu Ala Pro Trp 
300 

Ser Arg Lys Glu 
315 

Arg Gly Gly Arg 
330 

Asp Leu Lys Glu 
345 

Ala Leu Arg Glu 



Arg Leu Arg Ala 
270 

Glu Phe Gly Leu 
285 

Pro Pro Pro Glu 



Pro Met Trp Ala 
320 

Val His Arg Ala 
335 

Ala Arg Gly Leu 
350 

Gly Leu Gly Leu 
365 



Pro Pro Gly Asp Asp Pro Met Leu 
370 375 

Asn Thr Thr Pro Glu Gly Val Ala 
385 390 

Glu Glu Ala Gly Glu Arg Ala Ala 
405 



Leu Ala Tyr Leu Leu Asp Pro Ser 
380 

Arg Arg Tyr Gly Gly Glu Trp Thr 
395 400 

Leu Ser Glu Arg Leu Phe Ala Asn 
410 415 



Leu Leu Lys Arg 
420 

Glu Val Glu Arg 
435 

Gly Val Arg Leu 
450 

Ala Glu Glu He 
465 

His Pro Phe Asn 



Asp Glu Leu Arg 
500 

Arg Ser Ser Ser 
515 



Leu Glu Gly Glu 



Pro Leu Ser Ala 
440 

Asp Val Ala Tyr 
455 

Ala Arg Leu Glu 
470 

Leu Asn Ser Arg 
485 

Leu Pro Lys Leu 



Ala Ala Val Leu 
520 



Glu Arg Leu Leu 
425 

Val Leu Ala His 



Leu Arg Ala Leu 
460 

Ala Glu Val Phe 
475 

Asp Gin Leu Glu 
490 

Lys Lys Thr Lys 
505 

Glu Ala Leu Arg 



Trp Leu Tyr Arg 
430 

Met Glu Ala Thr 
445 

Ser Leu Glu Val 



Arg Leu Ala Gly 
480 

Arg Val Leu Phe 
495 

Lys Thr Gly Lys 
510 

Glu Ala His Pro 
525 



He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
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545 



550 555 560 



Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Ala Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 61 
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<211> 839 
<212> PRT 
<213> Artificial 



<220> 

<22 3> Synthetic 
<400> 61 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 
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Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 
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Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Gly His His His His His His 



835 



<210> 
<211> 
<212> 



62 

842 

PRT 
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<213> Artificial 
<220> 

<223> Synthetic 
<400> 62 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 
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Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Leu Gly Leu Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu 
405 410 415 

Phe Ala Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp 
420 425 430 

Leu Tyr Arg Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser 
450 455 460 

Leu Glu Val Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Gly Leu Pro Ala He Lys Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Ser Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 
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Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp 
595 600 605 

Leu Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg 
625 630 635 640 

Asp He His Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 VOO 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Glu Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala 
785 790 795 800 

Glu Ala val Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Glu His His His His His His 
835 840 

<210> 63<211> 835 

<212> PRT 

<213> Artificial 

<220> 
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<223> Synthetic 
<400> 63 

Met Glu Phe Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 

Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 9S 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg He Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn He Pro Gly Val Ala Gly He Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys He Glu Ala His Leu Glu Asp 
225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg He Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 
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Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Glu Gly Arg Val 
325 330 335 

His Arg Ala Thr Ser Pro Val Glu Ala Leu Ala Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Phe Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Val Ala Leu Asp Pro Thr Asp Asp Pro Leu Leu Val Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ala Asn Thr His Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Phe Thr Glu Asp Ala Ala Glu Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

Phe Gin Asn Leu Phe Lys Arg Leu Ser Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

Gin Glu Val Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Arg Gly Val Arg Leu Asp Val Pro Leu Leu Glu Ala Leu Ser Phe Glu 
450 455 460 

Leu Glu Lys Glu Met Glu Arg Leu Glu Gly Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Gly Leu Thr Pro Val Gly Arg Thr Gin Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ala Gin Gly Ala Leu Glu Ala Leu Arg Gly Ala His 
515 520 525 

Pro lie Val Glu Leu lie Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys 
530 535 540 

Ser Thr Tyr Leu Asp Pro Leu Pro Arg Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu 
580 585 590 
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Gly Gin Arg lie Arg Lys Ala Phe Val Ala Glu Glu Gly Trp Leu Leu 
595 600 605 

Leu Ala Ala Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu Lys Arg Val Phe Arg Glu Gly Lys Asp He 
625 630 635 640 

His Thr Glu Thr Ala Ala Trp Met Phe Gly Leu Asp Pro Ala Leu Val 
645 650 655 

Asp Pro Lys Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Gly He Asp Tyr 
675 680 685 

Lys Glu Ala Glu Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp He Glu Arg Thr Leu Glu Glu Gly Arg Thr Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala 
725 730 735 

Ser Arg Val Arg Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys He Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Lys Pro Leu Gly Ala His Leu Leu Leu Gin 
770 775 780 

Val His Asn Glu Leu Val Leu Glu Val Pro Glu Asp Arg Ala Glu Glu 
785 790 795 800 

Ala Lys Ala Leu Val Lys Glu Val Met Glu Asn Ala Tyr Pro Leu Asp 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Val Gly Arg Asp Trp Leu Glu Ala 
820 825 830 

Lys Gin Asp 
835 

<210> 64 

<211> 832 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 64 

Met Glu Phe Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val He Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 9S 

Leu He Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg He Ser He Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn He 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Ala Lys Leu He Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 215 220 

Pro Ala Ser Val Arg Glu Lys He Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
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290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Asn Ala Leu Ala Ala Ala Trp Glu Gly Arg Val Tyr Arg Ala Glu 
325 330 335 

Asp Pro Leu Glu Ala Leu Arg Gly Leu Gly Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly He Ala Leu Ala 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Ala Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Glu Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu Tyr Ala Ala Leu 
405 410 415 

Leu Lys Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp Leu Tyr Glu Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Lys Ala Leu Ser Leu Glu Val Glu 
450 455 460 

Ala Glu He Arg Arg Phe Glu Glu Glu Val His Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val He Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 520 525 

Val Asp Arg He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys Gly Thr 
530 535 540 

Tyr He Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Asn Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Arg Leu Val Val 
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595 



600 605 



Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Gin Asp He His Thr 
625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Ser 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gly Glu Leu Ala He Pro Tyr Glu Glu 
675 680 685 

Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Tyr Pro Lys Val Arg 
690 695 700 

Ala Trp He Glu Lys Thr Leu Ala Glu Gly Arg Glu Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala Ser Arg 
725 730 735 

Val Lys Ser He Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu Leu Gin Val His 
770 775 780 

Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Gin Ala Glu Glu Val Ala 
785 790 795 800 

Gin Glu Ala Lys Arg Thr Met Glu Glu Val Trp Pro Leu Lys Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys Ala 
820 825 830 

<210> 65 
<211> 842 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 65 

Met Asn Ser Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 
15 10 15 
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Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 

Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg He Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn He Pro Gly Val Ala Gly He Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys He Glu Ala His Leu Glu Asp 
225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg He Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 
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Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 ' 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 
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His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 66 

<211> 838 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 66 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 
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Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu He Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg He Ser He Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn He 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Ala Lys Leu He Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 215 220 

Pro Ala Ser Val Arg Glu Lys He Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val His Arg Ala Ala 
325 330 335 
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Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu Leu 
340 345 350 



Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu Val 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn Leu 
405 410 415 

Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu Ala 
450 455 460 

Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 495 

Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro lie 
515 520 525 

Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn Thr 
530 535 540 

Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg Leu 
545 550 555' 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val Ala 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His Thr 
625 630 635 640 
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Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Pro 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu Glu 
675 680 685 

Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 

Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg 
725 730 735 

Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val Ala 
770 775 780 

Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val Ala 
785 790 795 800 

Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys Gly 
820 825 830 

His His His His His His 
835 

<210> 67 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 67 

Met Asn Ser Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 

Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 
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Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg He Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn He Pro Gly Val Ala Gly He Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys He Glu Ala His Leu Glu Asp 
225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg He Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly Arg Val 
325 330 335 

His Arg Ala Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala 
340 345 350 
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Arg Gly Leu Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly 
355 360 365 



Leu Gly Leu Pro Pro Gly Asp Asp Pro I^et Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu 
405 410 415 

Phe Ala Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp 
420 425 430 

Leu Tyr Arg Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser 
450 455 460 

Leu Glu Val Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Gly Leu Pro Ala lie Lys Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Ser Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp 
595 600 605 

Leu Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg 
625 630 635 640 



Asp He His Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu 
645 650 655 
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Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Glu Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala 
785 790 795 800 

Glu Ala Val Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Glu His His His His His His 
835 840 

<210> 68 

<211> 838 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 68 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val He Val Val 
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50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu He Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg He Ser He Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn He 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Ala Lys Leu He Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 215 220 

Pro Ala Ser Val Arg Glu Lys He Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val His Arg Ala Pro 
325 330 335 

Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu Pro 
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355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn Leu 
405 410 415 

Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg Glu 
420 425 430 

Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val Ala 
450 455 460 

Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala He Lys Lys Thr Gin Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 520 525 

Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr 
530 535 540 

Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val Ala 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His Thr 
625 630 635 640 

Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 
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660 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu 
675 680 685 

Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 

Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg 
725 730 735 

Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val Ala 
770 775 780 

Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val Ala 
785 790 795 800 

Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys Glu 
820 825 830 

His His His His His His 
835 

<210> 69 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 69 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 



60 
120 
180 
240 



caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 



360 
420 
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cxci^ a. ^ L. L- V- ca. 


ccagct cctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 






c 1" era c 1" 1 1 ao 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccga.cta.cc 


gg y c c CL. y ci 




t~ rrcracaacc 

4^ \^ ^ ^ >^ 


ttcccggggt 


caagggcatc 


600 




cggcgaggaa 


gcttctggag 


y dy *-yyyyy CL 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


aacraaaatcc 


tggcccacat 


ggacgatctg 


720 




gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 




agcccgaccg ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


qcaaqqaqcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


agggg c u dg a. 


Uv- CC-y uy v-^*— 


^ C* v-^ ^ w 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


g*-yy *-y^yy^ 


actaCQQQQQ 


qqaqtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


at cggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


Iw* ^-1 ^—4 CtCA O- ^ ' ^ 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcct aCCt 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tggggaagac 


gcaaaagaca 


ggcaagcgct 


ccaccagcgc 


cqcqqtqctq 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


^ W t^^j ^ 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tT'par'PPaacr 

^ O.V — y 3 


aacQQQCcqc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


gg c u u a.y 1- ciy 


(~< Y r^rrpi r'r'r^c 
^^t^<^^yci*^'wVw-v-- 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


yyd Uv-^cyv^*— y 


cfOT'pt' 1" cat a 


1800 


gccgaggcgg 


9ttgggcgtt 


ggtggccctg 


gactatagcc 


dy ct t-ciy cty ^.^ u 


cr'ar'at" ret c 


1860 


gcccacctct 


ccggggacga 


aaacctgatc 


agggtcttcc 


aggaggggaa 


ggacatccac 


1920 


acccagaccg 


caagctggat 


gttcggcgtc 


cccccggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agacggtgaa 


cttcggcgtc 


ctctacggca 


tgtccgccca 


taggctctcc 


2040 


caggagcttg 


ccatccccta 


cgaggaggcg 


gtggccttta 


tagagcgcta 


cttccaaagc 


2100 
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ttccccaagg 


tgcgggcctg gatagaaaag 


accctggagg 


aggggaggaa 


gcggggctac 


2160 


gtggaaaccc 


tcttcggaag 


aaggcgctac 


gtgcccgacc 


tcaacgcccg 


ggtgaagagc 


2220 


gtcagggagg 


ccgcggagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 




gaagctcttc 


ccccgcctcc gggagatggg ggcccgcatg 


2340 


ctcctccagg 


tccacaacga 


gctcctcctg gaggcccccc 


aagcgcgggc 


cgaggaggtg 


2400 


gcggctttgg 


ccaaggaggc 


catggagaag 


gcctatcccc 


tcgccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


tgggggagga 


ctggctttcc 


gccaagggtc 


accaccacca 


ccaccac 


2517 


<210> 70 
<211> 2526 
<212> DNA 
<213> Artificial 












<220> 

<223> Synthetic 












<400> 70 


cty y y ct i_ 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 




■h nrtc r* "t" ;^ rrrr 
L.yyv_^*^ u.civ_,\_-y 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtcca 


ccgggccccc 


1020 



489 



gagccttata 


aagccctcag 


ggacctgaag 


gaggcgcggg 


ggcttctcgc 


caaagacctg 


1080 


aQcqttctqq 


ccctgaggga 


aggccttggc 


ctcccgcccg 


gcgacgaccc 


catgctcctc 


1140 


gcctacctcc 


tqqacccttc 


gaacaccacc 


cccgaggggg 


tggcccggcg 


ctacggcggg 


1200 


aaatQCfacQQ 


aqqaqqcgqg 


ggagcgggcc 


gccctttccg 


agaggctctt 


cgccaacctg 


1260 


y z) y ri ^ 3 ^ 


ttqaqqqqqa 


ggagaggctc 


ctttggcttt 


accgggaggt 


ggagaggccc 


1320 


ctttccgctg 


tcctggccca 


tatggaggcc 


acgggggtgc 


gcctggacgt 


ggcctatctc 


1380 


agggccttgt 


ccctggaggt 


ggccgaggag 


atcgcccgcc 


tcgaggccga 


ggtcttccgc 


1440 


CtQQCCqqCC 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


cctctttgac 


1500 


aaQCtaqqqc 


ttcccgccat 


cggcaagacg 


gagaagaccg gcaagcgctc 


caccagcgcc 


1560 


gccgtcctgg 


aggccctccg 


cgaggcccac 


cccatcgtgg 


agaagatcct 


gcagtaccgg 


1620 


gagctcacca 


agctgaagag 


cacctacatt 


gaccccttgc 


cggacctcat 


ccaccccagg 


1680 


aCCTQQCCQCC 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacgggcag 


gctaagtagc 


1740 


tccgatccca 


acctccagaa 


catccccgtc 


cgcaccccgc 


ttgggcagag 


gatccgccgg 


1800 


gcctt catcg 


ccqaqqaqqq 


gtggctattg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 




cccacctctc 


cggcgacgag 


aacctgatcc 


gggtcttcca 


ggaggggcgg 


1920 


gacatccaca 


cggagaccgc 


cagctggatg 


ttcggcgtcc 


cccgggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


qqqcqqc caa 


gaccatcaac 


ttcggggtcc 


tctacggcat 


gtcggcccac 


2040 


cgcctctccc 


aqqaqctaqc 


catcccttac 


gaggaggccc 


aggccttcat 


tgagcgctac 


2100 


tttcagagct 


tccccaaggt 


gcgggcctgg 


attgagaaga 


ccctggagga 


gggcaggagg 


2160 


cgggggtacg 


tggagaccct 


cttcggccgc 


cgccgctacg 


tgccagacct 


agaggcccgg 


2220 


gtgaagagcg 


tgcgggaggc 


ggccgagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


ggctatggtg 


aagctcttcc 


ccaggctgga 


ggaaatgggg 


2340 


gccaggatgc 
gaggccgtgg 


tccttcaggt 
cccggctggc 


ccacaacgag 
caaggaggtc 


ctggtcctcg 
atggaggggg 


aggccccaaa 
tgtatcccct 


agagagggcg 
ggccgtgccc 


2400 
2460 


ctggaggtgg 


aggtggggat 


aggggaggac 


tggctctccg 


ccaaggagca 


ccaccaccac 


2520 


caccac 












2526 



<210> 71 

<211> 2517 

<212> DNA 

<213> Artificial 



490 



<220> 

<223> Synthetic 



<400> 71 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg gctacgaggt ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


aaacracQCCcr 


cccaccQcrcfC 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tggggaagac 


gcaaaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 



491 



gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cg^gg^gg*^^ 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg gattgagaag 


accctggagg agggcaggag gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 72 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 72 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 24 0 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 



480 



492 



gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 54 0 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 72 0 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 84 0 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 102 0 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 108 0 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 12 0 0 

g^ggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gtgggggagg 1260 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 132 0 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 1380 

tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 14 40 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 

cttcccgcca tcggcaagac ggagaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 162 0 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 168 0 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 174 0 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcgtg 180 0 

gccgaggcgg gttgggcgtt ggtggccctg gactatagcc agatagagct ccgcgtcctc 1860 

gcccacctct ccggggacga aaacctgatc agggtcttcc aggaggggaa ggacatccac 192 0 

acccagaccg caagctggat gttcggcgtc cccccggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agacggtgaa cttcggcgtc ctctacggca tgtccgccca taggctctcc 2040 

caggagcttg ccatccccta cgaggaggcg gtggccttta tagagcgcta cttccaaagc 2100 

ttccccaagg tgcgggcctg gatagaaaag accctggagg aggggaggaa gcggggctac 2160 



493 



gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 



2340 
2400 
2460 



gtggaaaccc tcttcggaag aaggcgctac gtgcccgacc tcaacgcccg ggtgaagagc 2220 
gtcagggagg ccgcggagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 
ctcatgaagc tcgccatggt gaagctcttc ccccgcctcc gggagatggg ggcccgcatg 
ctcctccagg tccacaacga gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 
gcggctttgg ccaaggaggc catggagaag gcctatcccc tcgccgtgcc cctggaggtg 
gaggtgggga tgggggagga ctggctttcc gccaagggtc accaccacca ccaccac 2517 

<210> 73 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 73 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 6 0 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 18 0 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 3 60 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 



480 



gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 



600 
660 



gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 72 0 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 



840 
900 



cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 



1020 



gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 1080 



494 



gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


CCCCqaqqqq 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


caCQQQQqtq 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


Qccqaqqaqq 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acqgaqaccq 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


J> r± 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 74 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 



495 



<400> 74 



atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 




cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gtgggggagg 


1260 


cttgaggggg 




cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tggggaagac 


gcaaaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 
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CLCCaCd-CCC 


yCLL-caacca 


y acgg ccacg 


gccacgggga 


y y C U Lciy Uciy 


t^LOt^ycH-O*.-^- 


1 7 A n 


aHcctgcsga. 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggcc t teat c 


1 Q n n 
X iJ u u 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


c^gggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


±y u 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


T Q Q n 
_L i? o U 


cgggcggcca 


agaccatcaa 


ct t cggggtc 


ct c tacggca 


tgt cggccca 


ccgcc ucucc 


o A A n 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


z lU U 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


Z Xb U 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


222 0 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


1 o Q n 
^ Z o U 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


agg^^s-tggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 


<210> 75 
<211> 2517 
<212> DNA 
<213> Artificial 












<220> 

<223> Synthetic 












<400> 75 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 
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gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tggggaagac 


gcaaaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 
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ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 

ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 

<210> 76 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 76 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 
gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 
gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 
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ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 1200 

gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gtgggggagg 1260 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 132 0 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 13 80 

tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 

cttcccgcca tcggcaagac ggagaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 162 0 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 18 60 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 192 0 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2 22 0 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 80 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 234 0 

ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

9^99tgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 



<210> 77 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 77 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 
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ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 3 00 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 42 0 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 72 0 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 84 0 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

^tgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 102 0 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 108 0 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 114 0 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 12 60 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 132 0 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 13 80 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 144 0 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt ggggaagacg caaaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 
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tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcatcg 


ccgaggaggg 


gtggctattg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


agggtgctgg 


cccacctctc 


cggcgacgag 


aacctgatcc 


gggtcttcca 


ggaggggcgg 


1920 


gacatccaca 


cggagaccgc 


cagctggatg 


ttcggcgtcc 


cccgggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gaccatcaac 


ttcggggtcc 


tctacggcat 


gtcggcccac 


2040 


cgcctctccc 


aggagctagc 


catcccttac 


gaggaggccc 


aggccttcat 


tgagcgctac 


2100 


tttcagagct 


tccccaaggt 


gcgggcctgg 


attgagaaga 


ccctggagga 


gggcaggagg 


2160 


cgggggtacg 


tggagaccct 


cttcggccgc 


cgccgctacg 


tgccagacct 


agaggcccgg 


2220 


gtgaagagcg 


tgcgggaggc 


ggccgagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


ggctatggtg 


aagctcttcc 


ccaggctgga 


ggaaatgggg 


2340 


gccaggatgc 


tccttcaggt 


ccacaacgag 


ctggtcctcg 


aggccccaaa 


agagagggcg 


2400 


gaggccgtgg 


cccggctggc 


caaggaggtc 


atggaggggg 


tgtatcccct 


ggccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


aggggaggac 


tggctctccg 


ccaaggagca 


ccaccaccac 


2520 


caccac 












2526 


<210> 78 
<211> 2526 
<212> DNA 
<213> Artificial 












<220> 

<223> Synthetic 












<400> 78 
atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aa-ggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 
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ggcatcgggg 


agaagaccgc 


cc t caagc t c 


cucaaggag u 




an^ :^ ^ ^ n 

^ Cl.Cl.ClCl%^ \^ L. 


660 




ctcaagaacc 




aaagccagaa 


aacguccggg 


agaagci uCo-d 




72 0 




gaagacctca 


ggctctcctt 


ggagctctcc 


cg99t9cgca 


ft "-a /~i /"I 

ccgacccuv-c 


t^v^ uy y cty y Ly 


780 




gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggc 1 1 a 


gggcc UECCU. 


ggagaggcug 


o *± u 




gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


Q n n 




gccccctggc 


ccccgccgga 


aggggccttc 


9t999Cttcg 


ccc Lcccccg 


ff f* f* f*f< !3 1^ /"> ft 

ccccy agcL-c- 


J? o u 




atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgt cca 


ccgggccccc 


1 n 9 n 

X U u 




gagccttata 


aagccctcag 


ggacctgaag 


gaggcgcggg 


ft/~r +" 4- /~*fff 

ggcL-uc ucgc 


Octdciy civ_v^ uy 


1 OR D 

X u o u 




sgcgttctgg 


ccctgaggga 


aggccttggc 


ctcccgcccg 


gcgacgaccc 


jT-f -a -I— f€ ft 4— /-* /-I 1^ 

caugcL-ocL-c 


1 1 4. 0 

X X U 




gcctacctcc 


tggacccttc 


gaacaccacc 


cc cgaggggg 


i^ggcccggcg 


CT-acyguggy 


12 0 0 




gagtggacgg 


aggaggcggg 


ggagcgggcc 


gccctttccg 


agaggct ctt 


ft ft ft ^ /** ^ 4— 

cgccaaccug 


xz o u 


y 


tgggggaggc 


ttgaggggga 


ggagaggctc 


ctttggcttt 


accgggaggt 


ggagaggccc 


1 o n 




ctttccgctg 


t cctggccca 


tatggaggcc 


acgggggtgc 


gccuggacgc 


frfTft f \~ ^ ^ ft 

ggccua L-C CO 


X o vj 




agggccttgt 


ccctggaggt 


ggccgaggag 


atcgcccgcc 


t cgaggccga 


ft ft +^ 4— ft ft ft ft 

ggc c u uccgc 


1 A d n 

Xrt *± W 


SB 


ctggccggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


ft ft ft 4~ 4~ 4-^ /— n 

cctic L. uugac 


1 n n 
XD u u 




gagctagggc 


ttcccgccat 


cggcaagacg 


gagaagaccg 


gcaagcgctc 


caccagcgcc 


XO O u 


fl 


gccgtcctgg 


aggccctccg 


cgaggcccac 


cccatcgtgg 


agaagatcct 


gcagtaccgg 


Xo ^ U 




gagctcacca 


agctgaagag 


cacctacatt 


gaccccttgc 


cggacctcat 


ccaccccagg 


XD O V 




acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacgggcag 


gcfcaagtagc 


T T A n 

X / 4t u 




tccgatccca 


acctccagaa 


catccccgtc 


cgcaccccgc 


ttgggcagag 


gatccgccgg 


-I Q A A 

X o U U 




gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


Xo D U 




cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


9gag999aag 


192 0 




gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 




ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2 04 0 




aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 




ttccaaagct 


tccccaaggt 


9cgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 




cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 




gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 



503 





gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 




gcccgcatgc 


tcctccaggt 


ccacaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 




gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 




ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 




caccac 












2526 




<210> 79 
<211> 2499 
<212> DNA 
<213> Artificial 














<220> 

<223> Synthetic 












5 


<400> 79 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


i 


caccac ctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


U I 


^ y y y ^ y y 


pcrcrl" pt 3 cQd 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 




y cn^y y y L-y d 


L-x^yL-yyu-k^cL. 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 




onrri" a r* j:^ ;^ rrrr 
yyy i-.ci.v_-ctcxyy 


rTtcjarTTiCirir' 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


hi 


ctciyyctyx^ i^yy 


i~ fTn 3 r** f"* i~ rr^ t" 
L.yyciov^ i^v^v^ L. 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 




gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 




accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 




gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 




gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 




ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 




aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 




aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 




aaaaggcggg 


agcccgaccg 


gg^gaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 




agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 




cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 




gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 




gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 




gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 



504 



ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc atatggaggc caccggggta cggcgggacg tggcctacct tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tggggaagac 


gcaaaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcgtg 


1800 


gccgaggcgg 


gttgggcgtt 


ggtggccctg 


gactatagcc 


agatagagct 


ccgcgtcctc 


1860 


gcccacctct 


ccggggacga 


aaacctgatc 


agggtcttcc 


aggaggggaa 


ggacatccac 


1920 


acccagaccg 


caagctggat 


gttcggcgtc 


cccccggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cqqqcqqcca 


agacggtgaa 


cttcggcgtc 


ctctacggca 


tgtccgccca 


taggctctcc 


2040 


caggagcttg 


ccatccccta 


cgaggaggcg 


gtggccttta 


tagagcgcta 


cttccaaagc 


2100 


ttccccaagg 


tgcgggcctg 


gatagaaaag 


accctggagg 


aggggaggaa 


gcggggctac 


2160 


gtggaaaccc 


tcttcggaag 


aaggcgctac 


gtgcccgacc 


tcaacgcccg 


ggtgaagagc 


2220 


gtcagggagg 


ccgcggagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tcgccatggt 


gaagctcttc 


ccccgcctcc 


gggagatggg 


ggcccgcatg 


2340 


ctcctccagg 


tccacgacga 


gctcctcctg 


gaggcccccc 


aagcgcgggc 


cgaggaggtg 


2400 


gcggctttgg 


ccaaggaggc 


catggagaag 


gcctatcccc 


tcgccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


tgggggagga 


ctggctttcc 


gccaagggt 






2499 



<210> 80 

<211> 2499 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 80 



505 



atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tggggaagac 


gcaaaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 



506 



aacctgcaga acatccccgt ccgcaccccc ttgggccaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2 04 0 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2 2 80 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 234 0 

ctccttcagg tccacgacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggag 24 99 

<210> 81 
<211> 2499 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 81 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 24 0 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 



480 



gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 



gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 



600 



507 



ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gtgggggagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg gagaagatcc 


tgcagtaccg ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcgtg 


1800 


gccgaggcgg 


gttgggcgtt 


ggtggccctg 


gactatagcc 


agatagagct 


ccgcgtcctc 


1860 


gcccacctct 


ccggggacga 


aaacctgatc 


agggtcttcc 


aggaggggaa 


ggacatccac 


1920 


acccagaccg 


caagctggat 


gttcggcgtc 


cccccggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agacggtgaa 


cttcggcgtc 


ctctacggca 


tgtccgccca 


taggctctcc 


2040 


caggagcttg 


ccatccccta 


cgaggaggcg 


gtggccttta 


tagagcgcta 


cttccaaagc 


2100 






gatagaaaag 


accctggagg 


aggggaggaa 


gcggggctac 


2160 


gtggaaaccc 


tcttcggaag 


aaggcgctac 


gtgcccgacc 


tcaacgcccg 


ggtgaagagc 


2220 


gtcagggagg 


ccgcggagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tcgccatggt 


gaagctcttc 


ccccgcctcc 


gggagatggg 


ggcccgcatg 


2340 



508 



ctcctccagg 


tccacgacga 


gctcctcctg 


gaggcccccc 


aagcgcgggc 


cgaggaggtg 


2400 


gcggctttgg 


ccaaggaggc 


catggagaag 


gcctatcccc 


tcgccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


tgggggagga 


ctggctttcc 


gccaagggt 






2499 


<210> 82 
<211> 2499 
<212> DNA 
<213> Artificial 












<220> 

<223> Synthetic 












<400> 82 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


99tggacggc 


D u 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


ion 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


ion 
low 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


gg act acggg 


z 4 U 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


T n n 
^ u u 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


o to u 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


A o r\ 
4 z U 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


A o n 
41 o U 


qqqtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


b4 U 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


60 0 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


iz £Z r\ 
D b U 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


/AO 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg ggagtggacg 


1200 



509 



gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 12 6 0 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 132 0 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 13 80 

tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 

cttcccgcca tcggcaagac gcaaaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 162 0 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 192 0 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 210 0 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 222 0 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 80 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tccacgacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 24 60 

gaggtgggga taggggagga ctggctctcc gccaaggag 2499 



<210> 83 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 83 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 



510 



ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcc t cc tea 


dy y L. dd 


yy<=^yy^'"yyy 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


/~* \~ t~ 

gcccccuct^L- 


UCL^yL-v^d^— yd 


acr r* p t" a c Cf OCT 

yyL^(^ud.»wyy^ 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggacuc uc 


L-yyL-ddL-i- 


fcrrTT'tr'atc 

V— ^ Vji ^ ra. ^ 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


ccccyyyt-L.d 


^y*^yy ^yy 


360 


gacgtcctgg 


ccagcctggc 


caay aagg 


yddctctyyctyy 


cfpta pcraocft" 
y ^ 1.^0. \_. ^ y 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctccfct 


•)— /—I /-t/~f -a rt 
UCQ-.yclL.L-y*— ci 






480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 




c 0 a c c acr t Qcr 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


u L.L^ov-'yyyy 1— 


p a a crcrcf c a t c 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


y^L. y y ctcty 


ppt~ ppt caacT 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg gagaagatcc 


uyyv^L^L 'dL^ciu. 


craa paafcctcr 


720 


aagctctcct 


gggacctggc 


^ d cty y L. y V— y ^ 


accgacctgc 


cccuy y dgy u 


yydv— L. L.v.^yL.^ 


780 


aaaaggcggg 


agcccgaccg 




agggcctttc 


uggagaggcc 


uy dy i_ L- uy y V— 


840 


agcctcctcc 


acgagttcgg 


C C L. U C C y y a.ci 


agccccaagg 


CCC uy y dyy a. 


yyt—L-L^L-Q-L-yy 


900 


cccccgccgg 


aaggggcctt 


cgtgggcfc tt 


gtgctttccc 


gcaaggay cc 


L-dcy '^yyy'— 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


3.aggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc gctacggcgg ggagtggacg 


1200 


gaggaggcgg 


ggcaccgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gtgggggagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1 7 9 n 

X J Z W 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1 1 ft n 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 




caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


-1 tr n n 


cttcccgcct 


tggggaagac 


gcaaaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


T cr n 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcatc 


1800 



511 



gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 192 0 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 198 0 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 80 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2 34 0 

ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 24 60 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 84 
<211> 2517 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 84 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 24 0 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 30 0 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 3 60 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 



480 



gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 54 0 



600 



gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 72 0 



512 



aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc ccctggaggt ^ 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg ggagaggctt agggcctttc tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgaguucgg 


ccttctggaa 


agccccaagg ccctggagga ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccc Liggccgc 


cgccaggggc ggccgcgtcc accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacc uy cici 


ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


/"< 4-' /"* /~< 

CCL-Cccguut- 


rrrrrrr?! naa PC CCatQCtCCt 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


c c c c gaggy g 




aqaqtqqacg 


1200 


gaggaggcgg 


ggg^gcgggc 


cgccctttcc 


gagaggctcc atcggaacct 


gtgggggagg 


1260 


cttgaggggg 


3.g9^9^9g^t 


cctttggctt 


taccgggagg tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tggggaa.ga.c 


gcaaaagaca 


ggcaagcgct ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc tccagcaccg 


ggagctcacc 


_L O ^ U 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga ggatccgccg ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg gactatagcc agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg ccgtggaccc 


cctgatgcgc 


_L O W 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca tgtcggccca 


ccgcctctcc 


9 n A n 

Z U rr VJ 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca ttgagcgcta 


ctttcagagc 


9100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg agggcaggag 


gcgggggtac 


9 1 (^0 


gtgga-ga.ccc 


tcttcggccg 


ccgccgctac 


gtgccagacc tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa aagagagggc 


ggaggccgtg 


2400 



513 



gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2 517 



<210> 85 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 85 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 



60 



caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 3 60 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 54 0 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 66 0 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 72 0 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 102 0 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 114 0 

ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 1200 

gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 1260 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 132 0 



514 



gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


13 80 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct tggggaagac gcaaaagaca ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1580 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga ggatccgccg ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2 040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


22 80 


ctcatgaagc 


tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 86 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 86 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 



515 



gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gtgggggagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tggggaagac 


gcaaaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 



516 



acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 
cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 
caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 
ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 
ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 



aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2517 



<210> 87 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 87 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 6 0 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 



180 
240 
300 



480 
540 
600 



ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 



720 



aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 
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aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgccgggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg aggaaatggg ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



518 



<210> 88 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 88 



atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


99tggacggc 


o U 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


12 0 


ccggtgcagg 


cggtctacgg 


cttcgccaag agcctcctca aggccctcaa 




1 Pin 

X o u 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


o A r\ 


gggtacaagg 


cgggccgggc 


ccccacgccg gaggactttc 


cccggcaact 


cgccctcatc 


"3 n n 
J u u 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcci-cgay y 


tcccgggc ta 


^9^99^99^*^ 


J D VJ 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaagy a.yy 


yL- '—cn^yci.yy i— 


ccgcatcctc 


A "9 n 
^ z u 


accgccgaca 


aagaccttta 


ccagctcctt 


uccy accy ca 


tTT'Pr'Crt'PPt" 
L- d y 


ccaccccgag 


^ o u 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


y uy dy y t-^ 


cgaccagtgg 


o4 U 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


L *-^t^^y y y y »- 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


y L-yy cLdy \_ 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


72 0 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


78 0 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


ccaggccttg 


1380 



519 



tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg agggcaggag gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 89 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 89 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 24 0 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 



520 



aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 




J D W 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 




accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


fcccacgt cct 


ccaccccyo.y 


A ft n 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 




gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


D U U 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


C C U C C L- C a.ciy 


66 0 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


72 0 


aagctctcct 


ggga-cctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga ggccccctgg 


u u 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 




cpi t~cf1"crcfacc 


you 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 




ctycu—k-^v-'O 


T A O A 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 




oyocy L-t^ *-y 


T n Q ri 
± u o u 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


cca uy c ucc l. 




1 1 A C\ 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1 o o n 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1 o n 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


T "5 o n 
X U 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1 '5 0 n 

X J 0 u 


tccctggagc 


ttgccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


T A A n 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


T c n A 
Xo U U 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


"1 c n 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


162 0 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 
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cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg gattgagaag 


accctggagg 


agggcaggag gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


u dy dy y v_ y 




2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 


<210> 90 
<211> 2517 
<212> DNA 
<213> Artificial 












<220> 

<223> Synthetic 












<400> 90 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag gccccctcct 


tccgccacga ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 
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cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


g^ggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatccgccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg agggcaggag gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 91 



523 



<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 91 



atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


^cggggctaa 


agy acc L-Co-o. 


gg^gg^^^yy 


y y v_- v—- v_. \^ ^ 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggagg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 



524 



cttcccgcca 


tcggcaagac 


99°-g°^^y 




ccaccacrccrc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 




gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


a.ay c L.gCTa.y d. 




tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 




gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aa.cctcca.ga 


acatccccgt 


ccgcaccccg 


cttgggcaga ggatccgccg ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


a cggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 




agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


t tccccaagg 


tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


234 0 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 92 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 92 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 24 0 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 



300 



aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 



525 



gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaagg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg ggtggctatt 


ggtggccctg gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 



526 



ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 
ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 



2280 
2340 
2400 
2460 
2517 



240 
300 



<210> 93 
<211> 2517 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 93 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 6 0 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 18 0 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 3 60 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 72 0 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 84 0 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 



900 



cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 



527 



gatcttctgg ccctggccgc cgccaggggc 
gcggggctaa aggacctcaa ggaggtccgg 
gcctcgaggg aggggctaga cctcgtgccc 
ctggaccctt cgaacaccac ccccgagggg 
gaggacgccg cccaccgggc cctcctctcg 
ctcgaggggg aggagaagct cctttggctc 
gtcctggccc atatggaggc cacgggggtg 
tccctggagg tggccgagga gatcgcccgc 
caccccttca acctcaactc ccgggaccag 
Q cttcccgcca tcggcaagac gcaaaagacc 

gaggccctcc gcgaggccca ccccatcgtg 
aagctgaaga gcacctacat tgaccccttg 
ctccacaccc gcttcaacca gacggccacg 
pi aacctccaga acatccccgt ccgcaccccg 

O gccgaggagg ggtggctatt ggtggccctg 

gcccacctct ccggcgacga gaacctgatc 
J; acggagaccg ccagctggat gttcggcgtc 

¥^ cgggcggcca agaccatcaa cttcggggtc 

caggagctag ccatccctta cgaggaggcc 
ttccccaagg tgcgggcctg gattgagaag 
gtggagaccc tcttcggccg ccgccgctac 
gtgcgggagg cggccgagcg catggccttc 
ctcatgaagc tggctatggt gaagctcttc 
ctccttcagg tccacaacga gctggtcctc 
gcccggctgg ccaaggaggt catggagggg 
gaggtgggga taggggagga ctggctctcc 

<210> 94 

<211> 2517 

<212> DNA 

<213> Artificial 



ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


ctcgaggccg aggtcttccg 


cctggccggc 


1440 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


gccacgggca ggctaagtag ctccgatccc 


1740 


cttgggcaga ggatccgccg ggccttcatc 


1800 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gccaaggagc 


accaccacca 


ccaccac 


2517 



528 



<220> 

<223> Synthetic 
<400> 94 



atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


gguggacyy o 


fin 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccgggggydy 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 






gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


o A n 
z ^ u 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 




aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


J D U 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg gctacgaggt 


ccgcatcctc 


^ z u 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


^ ^-f ^ 4" ^ 4*" 

cccacgcccu 


ccaccccgag 


A Q r\ 
4 o u 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcccgaggcc 


cgaccagL.gg 


_) *t \j 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


O U VJ 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


b o u 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


T o n 
/ z U 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


/ o U 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


O ^ A 

o4 U 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


102 0 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 



529 



<400> 95 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 



gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagcaccg ggagctcacc 1620 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 18 60 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 222 0 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2 340 

ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2 517 



<210> 95 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



60 



caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 30 0 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 



480 



530 



gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagrgg 


R A n 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


n n 

D VJ U 


ggggagaaga 


cggcgaggaa gcttctggag gagtggggga gcctggaagc 


cccccucaag 


D D VJ 


aacctggacc 


ggc tgaagc c 


/"* ^ ^ ^ /Tj^ 

Cycoaucoyy 


Ct-CI.^ CI. 


tggcccacat 


ggacgatctg 




aagctctcct 


gggaccuggc 


octayy uy t^y v- 




ccctcJcraQQt 


ggacttcgcc 


7 ft n 


aaaaggcggg 


agcccgaccg 


yya.gayyccu 


CiyMM^^ ^ I— 


taaacfacrcrct 


tgagtttggc 


ft /in 
o ^ u 


agcctcctcc 


acgagc ucy y 


t" "h t" (Tn a a 
L. i_ouyycict 




ccctggagga 


ggccccctgg 


_? u u 


cccccgccgg 


aaggggcc u u 


cy LyyyoL-cu 




gcaaggagcc 


cacgcgggcc 


960 


gatcttctgg 


ccctggccgc 


cgc c ay ggy c 


ygc(^yuy uy(« 


a.v^ V* ^ ^ ^-"-^ '-^ 


agaccccttg 


1 n 9 n 
X u z u 


gcggggctaa 


aggacctcaa 




y y c \^ uv-y 


n ^5 ;^ acra r* c t 

\^ Cl O- ~H ~3 ^ ^ ^ 


cgccgtcttg 


1 n ft n 

X u o u 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccauy c u 


cgcctacctc 


X 14 U 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gc ua<^yyyyy 


ggagtggacg 


"1 o o n 
xz U U 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagagy c u V- 


a t" r^rirra ;^ r* f*^ 1~ 
ctL.L^yycicn-'^ v_ 


ccttaagcgc 


T O Pi 


ctcgaggggg 


aggagaagct 


cctttggctc 


uaccaogay y 


f~ rrrr;^ ;^ ;^ fT^"* r* 
t_yycicici.ciy\-.*w. 


cctctcccgg 


13 2 0 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgccuggacy 


•t- r\r% r' t" ;5 t" t" 
L-yyv--t^i— ctL-^L- 


cagggccttg 


T '3 ft n 
1 J o U 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


ay y c u^-uy 


cctggccggc 


A A C\ 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


t,CC-t.<— 1— L-L-yd. 


cgagctaggg 


1 c n A 
lb U U 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


c ua.v^(^ciy v^y V- 


cgccgtcctg 


Ibb U 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcaguaccy 


ggagctcacc 


loz U 


aagctgaaga 


acacctacat 


tgaccccttg 


ccggacctca 


cccaccccag 


gacgggccgc 




ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


174 0 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 


2160 



531 



2340 
2400 



gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 
ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 96 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 96 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 3 60 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 



480 



gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 54 0 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 1020 
gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 



1080 



532 





gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


CI iw» y w ^ > — ^ ^ 


cgcctacctc 


1140 




ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


rrr't" 3 r* CfCr Of CTCT 


aaacitcfQacQ 


1200 




gaggacgccg 


cccaccgggc 


cctcctctcg gagaggctcc 


ciL-v-yyo-cik-.v-^*- 


ccttaacjCQC 


1260 




ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 




rr't* p1~ pccctct 


1320 




gtcctggccc 


atatggaggc 


rciciCicsot.0 


CQCCtQcracq 




p ;^ crcf a c c fc t cr 
^ o.y y y ^ ^ ^ ^ 


1380 




tccctggagg 


tggccgagga 




p t" caacrcjc ccf 


cty y L- V- L- u 1^ ^w^y 


rr t" crcr c c era c 


1440 




caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 




*-y ^-'■y ^ '-c-yyy 


1500 




cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


C C ci u dy v^y ^- 


pcrcr'crtcc ta 


1560 




gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcag uaccy 


y y dy o u. ^ ci ^ ^ 


1620 


5 


aagctgaaga 


gcacctacgt 


ggaccccttg 


ccggacctca 


L- V— ■ d. w "w ^y 


crs cacrcrccQC 


1680 


i 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggc uciciy Ldy 


r'tT'paat'CCC 

V^y CL ^ V— 


1740 




aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


yyd L-(^v— y^t— y 


ggcctt cafcc 


1800 




gccgaggagg 


ggtggctatt 


ggtggccctg gactatagcc 


aga udy dy u 


^'^'•y yy "-y ^ <-y 


1860 




gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggagggg*^y 


y y ct o. ^ ci w 


1920 




acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


c c g c g y d c_- <- L- 




1980 




cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


ug ucggccc-d 


r^rrrrr' P 1~ PP 
v^y "k-- V— • 1— v-rf 


2040 


ri 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


L-i-gagcgci-d 


^ U 1— ^cLy ciy 


2100 




ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 




gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 




gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 




ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg aggaaatggg ggccaggatg 


2340 




ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 




gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 




gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 97 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



533 



<400> 97 



atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa gcttctggag gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccgagcctcg 


tccaccccag 


gacgggccgc 


1680 



534 



ctccacaccc gcttcaacca gacggccacg 
aacctccaga acatccccgt ccgcaccccg 
gccgaggagg ggtggctatt ggtggccctg 
gcccacctct ccggcgacga gaacctgatc 
acggagaccg ccagctggat gttcggcgtc 
cgggcggcca agaccatcaa cttcggggtc 
caggagctag ccatccctta cgaggaggcc 
ttccccaagg tgcgggcctg gattgagaag 
gtggagaccc tcttcggccg ccgccgctac 
gtgcgggagg cggccgagcg catggccttc 
ctcatgaagc tggctatggt gaagctcttc 
ctccttcagg tccacaacga gctggtcctc 
gcccggctgg ccaaggaggt catggagggg 
gaggtgggga taggggagga ctggctctcc 

<210> 98 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 98 

atgaattcgg ggatgctgcc cctctttgag 
caccacctgg cctaccgcac cttccacgcc 
ccggtgcagg cggtctacgg cttcgccaag 
gacgcggtga tcgtggtctt tgacgccaag 
gggtacaagg cgggccgggc ccccacgccg 
aaggagctgg tggacctcct ggggctggcg 
gacgtcctgg ccagcctggc caagaaggcg 
accgccgaca aagaccttta ccagctcctt 
gggtacctca tcaccccggc ctggctttgg 



gccacgggca ggctaagtag ctccgatccc 
cttgggcaga ggatccgccg ggccttcatc 
gactatagcc agatagagct cagggtgctg 
cgggtcttcc aggaggggcg ggacatccac 
ccccgggagg ccgtggaccc cctgatgcgc 
ctctacggca tgtcggccca ccgcctctcc 
caggccttca ttgagcgcta ctttcagagc 
accctggagg agggcaggag gcgggggtac 
gtgccagacc tagaggcccg ggtgaagagc 
aacatgcccg tccagggcac cgccgccgac 
cccaggctgg aggaaatggg ggccaggatg 
gaggccccaa aagagagggc ggaggccgtg 
gtgtatcccc tggccgtgcc cctggaggtg 
gccaaggagc accaccacca ccaccac 



cccaagggcc gggtcctcct ggtggacggc 
ctgaagggcc tcaccaccag ccggggggag 
agcctcctca aggccctcaa ggaggacggg 
gccccctcct tccgccacga ggcctacggg 
gaggactttc cccggcaact cgccctcatc 
cgcctcgagg tcccgggcta cgaggcggac 
gaaaaggagg gctacgaggt ccgcatcctc 
tccgaccgca tccacgtcct ccaccccgag 
gaaaagtacg gcctgaggcc cgaccagtgg 
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gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


ggg^gcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


gcaaaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


qqatccgccq 


ggccttcatc 


1800 


qccqaqqaqq 


qqtqqctatt 


ggtggccctg 


gactatagcc 


agatagagct 


caqqqtqctq 


1860 


gcccacctct 


ccqqcqacqa 


gaacctgatc 


cqqqtcttcc 


aqqaqqqqcq 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcqgqgtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 
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ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 
ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 

<210> 99 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 99 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 
aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 
gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 

537 



ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 12 00 
gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttgggagg 12 60 
cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 
gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 
tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 
caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 
cttcccgcca tcggcaagac gcaaaagacc ggcaagcgct ccaccagcgc cgccgtcctg 
gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 162 0 
aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 
ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 
aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 
gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 
gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 
acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 
cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 
caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 
ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 
ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 



<210> 100 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 

<400> 100 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 



1320 
1380 
1440 
1500 
1560 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



2517 



538 



caccacctgg 


1 4— -a /-» r~< r~f 3 ^ 
CCLa.C^L.yOcLt- 




ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggucuacgg 


i_ L-Oy^v— ciciy 


agcctcctca 


aggccctcaa 


ggagg^cggg 


180 


gacgcgguga 




i"aacaccaaQ 


gccccctcct 


tccgccacga 


ggcctacggg 


240 




r* f7 n rrr* cr Q CT P 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


■1— r^r~\ —1 f- /-I /"> i" 

tgga.cc ucc c 


yyyy'^'-yy'-y 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagccugy c 


Cctciycici.yyv,^y 


gaaaaggagg gctacgaggt ccgcatcctc 


420 


accgccgaca 


a.a.ga.CC L U L-ci 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


4"^ « *^ ^ /~1 /^/"T /"^ 

ucaccccyy c 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


ygycccL-ycu- 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


y y y y dy ciciy a. 


cggcgaggaa 


gcttctggag gagtggggga gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


ggga-cctggc 


caaggtgcgc 


accgacctgc 


ccctqqaqqt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


f~* ^ (~f ^^/~ifi 

cccccgccyy 




cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccct ca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gtggaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


gcaaaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 
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aacctccaga acatccccgt ccgcaccccg 
gccgaggagg ggtggctatt ggtggccctg 
gcccacctct ccggcgacga gaacctgatc 
acggagaccg ccagctggat gttcggcgtc 
cgggcggcca agaccatcaa cttcggggtc 
caggagctag ccatccctta cgaggaggcc 
ttccccaagg tgcgggcctg gattgagaag 
gtggagaccc tcttcggccg ccgccgctac 
gtgcgggagg cggccgagcg catggccttc 
ctcatgaagc tggctatggt gaagctcttc 
ctccttcagg tccacaacga gctggtcctc 
gcccggctgg ccaaggaggt catggagggg 
gaggtgggga taggggagga ctggctctcc 

<210> 101 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 101 

atgaattccg aggcgatgct tccgctcttt 
ggccaccacc tggcctaccg caccttcttc 
gaaccggtgc aggcggtcta cggcttcgcc 
gggtacaagg ccgtcttcgt ggtctttgac 
tacgaggcct acaaggcggg gagggccccg 
ctcatcaagg agctggtgga cctcctgggg 
gcggacgacg ttctcgccac cctggccaag 
atcctcaccg ccgaccgcga cctctaccaa 
cccgagggcc acctcatcac cccggagtgg 
cagtgggtgg acttccgcgc cctcgtgggg 
ggcatcgggg agaagaccgc cctcaagctc 



cttgggcaga ggatccgccg ggccttcatc 
gactatagcc agatagagct cagggtgctg 
cgggtcttcc aggaggggcg ggacatccac 
ccccgggagg ccgtggaccc cctgatgcgc 
ctctacggca tgtcggccca ccgcctctcc 
caggccttca ttgagcgcta ctttcagagc 
accctggagg agggcaggag gcgggggtac 
gtgccagacc tagaggcccg ggtgaagagc 
aacatgcccg tccagggcac cgccgccgac 
cccaggctgg aggaaatggg ggccaggatg 
gaggccccaa aagagagggc ggaggccgtg 
gtgtatcccc tggccgtgcc cctggaggtg 
gccaaggagc accaccacca ccaccac 



gaacccaaag gccgggtcct cctggtggac 
gccctgaagg gcctcaccac gagccggggc 
aagagcctcc tcaaggccct gaaggaggac 
gccaaggccc cctccttccg ccacgaggcc 
acccccgagg acttcccccg gcagctcgcc 
tttacccgcc tcgaggtccc cggctacgag 
aaggcggaaa aggaggggta cgaggtgcgc 
ctcgtctccg accgcgtcgc cgtcctccac 
ctttgggaga agtacggcct caggccggag 
gacccctccg acaacctccc cggggtcaag 
ctcaaggagt ggggaagcct ggaaaacctc 



540 



ctcaagaacc 


tgg^ccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta gggccttcct ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctQcraQcr 

# ^ V-^ ^ ^ ^ ^ 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 




tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 




accqqqtqca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacc tcaag 




gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


rt rcrtaccccr 


aqqacqaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 




tqQcqcqqcq 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


r t" ttaactct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 




cracQcr era cert 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


d i— rrt n 

at. o cy o v_- 


^^3^33 ^3 3 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg gtggccctgg actatagcca gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca ggaggggaag 


1920 


gacatcgcca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


X _7 o u 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2 040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


9 1 n 0 

^ -L U (J 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 



541 



gcccgcatgc tcctccaggt ccacaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 
gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 24 60 
ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 



caccac 



<210> 102 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



2520 
2526 



<400> 102 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 12 0 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 3 60 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 42 0 
atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 



480 



cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 54 0 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 60 0 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 72 0 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 
gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 
gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 102 0 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 12 00 



900 
960 
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gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


T o £r ri 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


13 2 0 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


T 7 n 
X J 0 U 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1 A A n 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


T c A n 
It) u u 


gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag gcaagcgctc 


caccagcgcc 


1 C A A 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcc t 


ccagcaccgg 


162 0 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagccucg l 


ccacccgagg 


n O A 

lb o U 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


^ggg^^agag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


192 0 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggaggcc 


atggccttca 


acatgcccgt 


ccagggcacc 


22 80 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


ccacaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 


caccac 












2526 



<210> 103 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 

<400> 103 
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atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


9cggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg gccgcgtgca ccgggcagca 


1020 


gaccccttgg 


cggggctaaa ggacctcaag gaggtccggg gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


Qaacttaaqc 

" ^ ^ W Vfc ^ ^ « 


ttcccgcctt 


ggggaagacg 


caaaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 



544 



<210> 104 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 104 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 



1920 
1980 



tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 
cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 
gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 222 0 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 22 80 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2 52 0 



2526 



ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 
tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 
ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 
gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 
atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 
cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 



60 
120 
180 
240 
300 
360 
420 
480 



545 



cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


qacrcttaqqc 


ttcccgcctt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 
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gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 24 60 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2 52 0 



caccac 



<210> 105 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 



2526 



<400> 105 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 3 60 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 



480 



gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 



600 



ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 78 0 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 1020 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 



900 
960 
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gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


gcaaaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 106 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 



548 



<400> 106 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 7 80 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 
aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 
gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 
ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 
gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 
cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 
gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 
tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 
caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 
cttcccgcca tcggcaagac gcaaaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 
gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 162 0 
aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


c^9g9tgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatcgcc 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 




caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


g^tgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 


<210> 107 
<211> 2517 
<212> DNA 
<213> Artificial 












<220> 

<223> Synthetic 












<400> 107 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtgg^cggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


gg^ggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 
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ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


acc taaaaoc 


cctcctcaag 


660 


aacctggacc ggctgaagcc cgccatccgg gagaagatcc 


L.yyowvrfd^o. L- 


ggacgatctg 


720 


aag cue uc-c u 


gggacctggc 


caaggtgcgc 


accgacctgc 


L, y y cty y 


Qdpi p t* 1" cere c 


780 


— \ --i f-a~^ f \ 

aaaay y yy 


agcccgaccg ggagaggctt 


agggcctttc 


cggagagy c t. 


L.ydy u. c ^yy^ 


840 


/~sf~t /~i (--1 /~i 4~ 
dy t^v- UL-t^ Ut— 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 




aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 




ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


dddyo^k^ L-v^o. 


gggacctgaa ggaggcgcgg 


gggcttctcg ccaaagacct gagcgttctg 


1080 


gcccugaggg 


ddgg t^t^ *-y y 


Ov-< L.^^v-^y^^v^ 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


c uggaccc u u. 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


13 80 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


cacccc t tea 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


^ 4~ ^ ^ ^ 
cuucccycod 


tcggcaagac 


gcaaaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggcccucc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagc ugddy d 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca ggctaagtag ctccgatccc 


1740 


aacctccaga 


acatccccgt 


cgccaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt ggtggccctg gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


199 0 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2 04 0 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 
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ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2 34 0 

ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 24 60 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 108 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 108 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 



480 



gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 72 0 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 102 0 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 12 00 
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1560 
1620 



gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 1260 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 132 0 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 1380 

tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 
cttcccgcca tcaagaagac gcaaaagacc ggcaagcgct ccaccagcgc cgccgtcctg 
gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 174 0 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 192 0 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2 04 0 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 22 2 0 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 8 0 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 24 60 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 109 
<211> 2517 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 109 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
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caccacctgg 


cctaccgcac 


cttccacgcc ctgaagggcc tcaccaccag 


ccggggggag 


120 




ccggtgcagg 


cggtctacgg 


cttcgccaag agcctcctca aggccctcaa 


ggaggacggg 


180 




gacgcggtga 


tcgtggtctt 


tgacgccaag gccccctcct tccgccacga 


ggcctacgag 


240 




gggtacaagg 


cgggccgggc 


ccccacgccg gaggactttc cccggcaact 


cgccctcatc 


300 




aaggagctgg 


tggacctcct 


ggggctggcg cgcctcgagg tcccgggcta 


cgaggcggac 


360 




gacgtcctgg 


ccagcctggc 


caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 


420 




accgccgaca 


aagaccttta 


ccagctcctt tccgaccgca tccacgtcct 


ccaccccgag 


480 




gggtacctca 


tcaccccggc 


ctggctttgg gaaaagtacg gcctgaggcc 


cgaccagtgg 


540 




gccgactacc 


gggccctgac 


cggggacgag tccgacaacc ttcccggggt 


caagggcatc 


600 






cggcgaggaa 


gcttctggag gagtggggga gcctggaagc 


cctcctcaag 


660 




aacctggacc 


ggctgaagcc 


cgccatccgg gagaagatcc tggcccacat 


ggacgatctg 


720 




aagctctcct 


gggacctggc 


caaggtgcgc accgacctgc ccctggaggt 


ggacttcgcc 


780 


ffl 


aaaaggcggg 


agcccgaccg 


ggagaggctt agggcctttc tggagaggct 


tgagtttggc 


840 


•k? 


agcctcctcc 


acgagttcgg 


ccttctggaa agccccaagg ccctggagga 


ggccccctgg 


900 




cccccgccgg 


aaggggcctt 


cgtgggcttt gtgctttccc gcaaggagcc 


catgtgggcc 


960 




gatcttctgg 


ccctggccgc 


cgccaggggc ggccgcgtcc accgggcccc 


cgagccttat 


1020 




aaagccctca 


gggacctgaa 


ggaggcgcgg gggcttctcg ccaaagacct 


gagcgttctg 


1080 




gccctgaggg 


aaggccttgg 


cctcccgccc ggcgacgacc ccatgctcct 


cgcctacctc 


1140 




ctggaccctt 


cgaacaccac 


ccccgagggg gtggcccggc gctacggcgg 


ggagtggacg 


1200 




gaggaggcgg 


gggagcgggc 


cgccctttcc gagaggctct tcgccaacct 


gcttaagagg 


1260 




cttgaggggg 


aggagaggct 


cctttggctt taccgggagg tggagaggcc 


cctttccgct 


1320 




gtcctggccc 


atatggaggc 


cacgggggtg cgcctggacg tggcctatct 


cagggccttg 


1380 




tccctggagg 


tggccgagga 


gatcgcccgc ctcgaggccg aggtcttccg 


cctggccggc 


1440 




caccccttca 


acctcaactc 


ccgggaccag ctggaaaggg tcctctttga 


cgagctaggg 


1500 




cttcccgcca 


tcggcaagac 


gcaaaagacc ggcaagcgct ccaccagcgc 


cgccgtcctg 


1560 




gaggccctcc 


gcgaggccca 


ccccatcgtg gagaagatcc tgcagtaccg 


ggagctcacc 


1620 




aagctgaaga 


gcacctacat 


tgaccccttg ccggacctca tccaccccag 


gacgggccgc 


1680 




ctccacaccc 


gcttcaacca 


gacggccacg gccacgggca ggctaagtag 


ctccgatccc 


1740 




aacctccaga 


acatccccgt 


. ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 


1800 
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gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


9cgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 


<210> 110 
<211> 2517 
<212> DNA 
<213> Artificial 












<220> 

<223> Synthetic 












<400> 110 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


cgagacggag 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 
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aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


gcaaaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca ggctaagtag ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg aggaaatggg ggccaggatg 


2340 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 
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gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 



2460 
2517 



<210> 111 
<211> 2517 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 111 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 3 60 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 60 0 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 78 0 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 102 0 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 12 0 0 

gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 1260 
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cttgaggggg 




cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


gcaaaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggsgctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 


<210> 112 
<211> 2517 
<212> DNA 
<213> Artificial 












<220> 

<223> Synthetic 












<400> 112 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 



gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 24 0 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 3 60 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 4 80 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

9999^9^^9^ cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 102 0 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 114 0 

ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 1200 

gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 1260 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cc tttccgct 132 0 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 13 80 

tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctcagg 1500 

cttcccaagt tgaagaagac gaagaagacc ggtaagcgct ccaccagcgc cgccgtcctg 156 0 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 162 0 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 



559 



gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


qccaaqqaqc 


accaccacca 


ccaccac 


2517 


<210> 113 
<211> 2517 
<212> DNA 
<213> Artificial 












<220> 

<223> Synthetic 












<400> 113 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccgqcaact 


cqccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcq 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 



560 



aaaaggcggg 


agcccgaccg 




3^gggCC t tt C 


tQcraQaQQCt 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


cc u L-Ccyy ad 


cty t-'V^ ^O-Ciyy 




aqccccctqq 


900 


cccccgccgg 


aaggggc c 1 1 


cy cyyy c u u u 


W ^ L' ^ ^ 


gc aaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgc caggggc 


ggccycy 


Clv— ^y y y ^ ^ ^ ^ 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgy 


rrrrcrr' t" 1" r i~ ca 

y ^ vj \^ L- ^ ^ ^ 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cc ucccgccc 


yyt^ycu^ycn^^ 


rratactcct 

\^ \^ ^ Vh^ ^ VhP^ ^ 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


g(^gy^^*-yy^ 


ap i" a caac aa 


QQaatQcracci 


1200 


gaggaggcgg 


gggagcgggc 


cgc cc u u L. c 


y ciy ciyy ^ l- 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggc t 


cc ccugg cc u 


cac-oyyy ciyy 


1" cro a cr acr a c c 


cctttccgct 


1320 


gtcctggccc 


auauggaggc 


cacgggygty 


p cr r* p 1" cf cf a c cr 


tggcctatct 


caqqgccttg 


1380 


tccctggagg 


tggccgagga 




y ^y y ^ ^ 3 


aggtctfcccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggacoag 




tcctctttga 


cgagctaagg 


1500 


cttcccaaga 


tcaacaagac 


gaagaagacc 


ggcaagcy c L- 


r^r^sr^pafTPOP 
t^ocn^ocxyv^y^^ 


caccatcctQ 

\^ ^ ^ ^ 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgacccc uug 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacy 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acauccccgu 




cttgggcaga ggatccgccg ggccttcatc 


1800 


gccgaggagg 


99tggctatt 


gguggccc uy 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 




caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg agggcaggag gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 



561 



gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 



2517 



<210> 114 
<211> 2517 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 114 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 3 60 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 54 0 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 84 0 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 1020 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 12 00 

gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 1260 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 132 0 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 13 8 0 



562 





tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 




caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaagg 


1500 




attcccaaga 


tcaagaagac 


gcataagacc 


ggtaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 




gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 




aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 




ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 




aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 




gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 




gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 




acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


m 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 




caggagctag 


ccatccctta 




caggccttca 


ttgagcgcta 


ctttcagagc 


2100 




ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 




gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 




gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 




ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 




ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 




gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 




gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 115 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 115 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacc tgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 



563 





gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 




aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 




gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 




accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 




gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 




gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 




ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 




aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 




aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 




aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


I 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


Si 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 




gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


|?-| 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 




gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 




ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 




gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 




cttgaggggg 


a-gga-ga.ggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 




gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 




tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 




caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctcagg 


1500 




cttcccaagt 


tgaagaagac 


gaagaagacc 


ggtaagcgct 


ccagcagcgc 


cgccgtcctg 


1560 




gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 




aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccaq 


gacgggccgc 


1680 




ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 




aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 




gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 




gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 




acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgafcgcgc 


1980 



564 



cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 
caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 
ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 
ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 24 00 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 



2160 
2220 
2280 
2340 



2517 



<210> 116 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 116 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 



180 
240 
300 
360 
420 
480 
540 
600 
660 
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agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 102 0 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 10 80 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 114 0 

ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 12 00 

gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 1260 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 1320 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 13 80 

f-- tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 144 0 

If caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctcagg 1500 

'Z.^ cttcccaagt tgaagaagac gaagaagacc ggtaagcgct ccaccagcgc cgccctcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 162 0 

Ir aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

-r™^; ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 18 00 

T gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 222 0 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 
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<210> 117 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 

<400> 117 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 



60 



ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 3 00 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 3 60 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 42 0 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 54 0 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccg aggggtcagg 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 



720 
780 



ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 
gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 84 0 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 102 0 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 12 0 0 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 12 60 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 13 20 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 13 80 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 
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t t-QQCOQQCC 


accccttcaa 


cctcaactcc 


cgggaccagc 


tgga9-3-gggt 


gctctttgac 


1500 




ttcccgcctt 




caaaagacag 


gcaagcgctc 


caccagcgcc 


1560 




acfcrrTT^ta cd 

civj ^ \^ \„ l„ ct, 


cfCf a crcTP p p s p 

yy^^yy^^*^*^^ 


p p a t" pert era 


aaaaaa t ppt 


ccagcaccgg 


1620 




aap t" p a a cr;5 a 


papptapotcr 


aappppptpp 


paaacpt pat 


CC C CCS 3.00 


1680 




^ \^ C*. W *w w V-* y 


clr^ c Pierce* cio 


a paappapaa 

CLv-f y y v-' c*.^y y 


p c a p aaaaacr 

o. wy y y y cLy 


ac t fcaataac 


1740 


tccgacccca. 


acctgcagaa 


catccccgtc 


cgcaccccc t 


toaacpaaaa 


gat ccgccgg 


1800 


gccttcgtgg 


cccraQQCQOcr 

^ ^ 3 3 rJ ^ 


ttgggcgttg 




actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


caacraacaaa 


aacctgatca 


gggtCttCCa 


003.00003.3.0 

3y*=^yyrjy*-^*^ri 


1920 


gacatcc3ca 


CCCB.^B.CCgC 


aaactcraatcr 

73 ^ ^ ^ *^ ^ 


fc tcaacQtcc 


^ w \^ ^ ^ 


cataaacccc 


1980 


ctaatocacc 


3 y ^ ^ 3 y ^ v^o. t* 


aa paataaa p 


t tpaacatcc 


tctacaacat 


atccacccat 


2040 


tA.^^ ^ ^ 


acroacrd" tap 


pat'ppppt'ap 


y^yy*=*-yy^yy 


t acf p p 1 1 1 a t 


aaaapcTPt a p 


2100 


w CI Cl^J w 


t~ ppppa acfprt 

^ dCLy y l_ 


pf pprnrrp p i~ rrn 

y^yyy^'-'-yy 


a 1~ aaaa a a a a 
c*. t— ciy cici cicty o. 


ppp1~aaaaaa 

\^ ^ i_ y y ciy y o. 


arracra Prrra a a 

y y y y '-'•y y "^^-^y 


2160 


caacrartaccr 


tcraaaapppt 


pt~ t* pcrcraacra 


arjcrr'ant" a per 


tapppcrappt 


— . CI Ci- y ^ ^ ^ 


2220 


y i-y ciciy ciy L>y 


^ <=*^y y y dy y 


^y^yy*-*y*^y^ 


ci(-.yyi_-v_-i_. L-i^d 


ar^a'horTT'rTt' 
ct\_-CLU,yt^v^v^y L- 


<^v_a.yyyi^cik-v_ 


22 80 








;3 ;3 i~ I" 
ci ciy ^ i,^ L. u v_ ^ 


^^v_.tw*y(^v^ i_^t^y 


yy^ya-ugggg 


9 4. n 

<i J U 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 


caccac 












o c o cr 


<210> 118 
<211> 2526 
<212> DNA 
<213> Artificial 












<220> 

<22 3> Synthetic 












<400> 118 
atgaattccg 


^ggcgatgct 




^ ^ ^ ^ ^ 
y aacccaaag 


r-Y r~i /^/^/-^ "i^ jF-^ 4— 

y ccgggucc L. 


cctggtggac 


o U 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


^ggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 
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ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


L. cgay y t- c c 


L.yy L^dx-^y 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggagggg ua 


"-yy *-y ^y ^ 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


— 1 r(^(Tt~ n {~* 

cit^t-^y(— y y^ 


pert" p r* t* p c a c 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


dy ucLt^yyt^wi- 


p P3 crcTP per cracT 
^^"•yy ^^y y y 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg gacccctccg 




pacrcratcaaQ 


600 


ggcatcgggg 


agtataccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg gaggggctta gggccttcct ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg gccgcgtgca 


ccgggcagca 


102 0 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


114 0 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg agaggctcca tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


X W VJ 


gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag 


gcaagcgctc 


caccagcgcc 


1 m^n 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


± oZ U 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


J. D O U 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1 A n 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 
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ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2 040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 22 8 0 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 234 0 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 24 60 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 

caccac 2526 

<210> 119 
<211> 2526 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 119 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 12 0 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 3 00 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 3 60 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 42 0 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 54 0 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcaggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 720 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 78 0 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 
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gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg cccccgcccc 


r' 1" crcf a era a a 

t-y y ciy y ciy 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg tcctctcccg 


o (_ v-^y cty 1.^ v« 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg gccgcgtgca 


ccyy y i-ciy *-ct 


1020 


gaccccttgg 


cggggctaaa ggacctcaag gaggtccggg gcctcctcgc 


r*^7=i rrrra r'pi' c 


1080 


gccgtcttgg 


cc ucgaggy ci 


zd zj ^ i:d ^ ci\^ 


ctcgtgcccg gggacgaccc 




1140 


gcctacctcc 


cggaccccuc 




ca.y y y y y 3 ^ -3 


ccacgggggg 


12 00 


gagtggacgg 


aggacgccgc 


c c a cy y y c v_ 


cT\Tt rod aaaaactcca 

^ v.^ I— y ^ ^ >^ V— w 


tzcgg aacccu 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


C L. U Uggcccu ciu\-cn^ya.yy l. 


f~^/~^ ^ a a 3 f~% 

ggaaaay t-uv^ 


1320 


ctctcccggg 


tcctggccca 


ta-tggaggcc 


accyyy y i-ci<- yy y y ^ 


orrrT't" a pr*tt 

\_ l_> V.^ ^ 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc tcgaggagga 


yy L>k^^ i_^-»v_-y^-< 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc tggaaagggt 


yt^uoi^ui-yciv-. 


1500 


gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag gcaagcgctc 


caccagcgcc 


1 

XD D u 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg agaagatcct 


ccagcaccgg 


± o z u 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc caagcctcgt 


ccacccgagg 


J. D O W 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg ccacggggag 


gcttagtagc 


174 0 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct tgggccagag 


gatccgccgg 


180 0 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg gtggccctgg actatagcca gatagagctc 


X O O \J 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca gggtcttcca 


ggaggggaag 


X _7 VJ 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc ccccggaggc 


cgtggacccc 


X _7 O VJ 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc tctacggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg tggcctttat 


agagcgctac 


^ X u w 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga ccctggagga 


ggggaggaag 


Z X D U 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg tgcccgacct 


caacgcccgg 


o o o n 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca acatgcccgt 


ccagggcacc 


O O O A 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg ccaagggtca 


ccaccaccac 


2520 
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2526 

caccac 

<210> 120 

<211> 2517 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 





<400> 120 
atgaattcgg 


ggatgctgcc 


cctctttgag cccaagggcc 


gggtcctcct 


ggtggacggc 


60 




caccacctgg 


cctaccgcac 


cttccacgcc ctgaagggcc 


tcaccaccag 


ccggggggag 


120 




ccggtgcagg 


cggtctacgg cttcgccaag agcctcctca aggccctcaa 


ggaggacggg 


180 




gacgcggtga 


tcgtggtctt 


tgacgccaag gccccctcct 


tccgccacga 


ggcctacggg 


240 


m 


gggtacaagg 


cgggccgggc 


ccccacgccg gaggactttc 


cccggcaact 


cgccctcatc 


300 




aaggagctgg 


tggacctcct 


ggggttcacg cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 




gacgtcctgg 


ccagcctggc 


caagaaggcg gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 




accgccgaca 


aagaccttta 


ccagctcctt tccgaccgca 


tccacgtcct 


ccaccccgag 


480 




gggtacctca 


tcaccccggc 


ctggctttgg gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


nj 


gccgactacc 


gggccctgac 


cgggga.cgag tccgacaacc 


ttcccggggt 


caagggcatc 


600 




ggggagaaga 


cggcgaggaa 


gcttctggag gagtggggga 


gcctggaagc 


cctcctcaag 


660 




aacctggacc 


ggctgaagcc 


cgccatccgg gagaagatcc 


tggcccacat 


ggacgatctg 


720 




aagctctcct 


gggacctggc 


caaggtgcgc accgacctgc 


ccctggaggt 


ggacttcgcc 


780 




aaaaggcggg 


agcccgaccg 


ggagaggctt agggcctttc 


tggagaggct 


tgagtttggc 


840 




agcctcctcc 


acgagttcgg 


ccttctggaa agccccaagg 


ccctggagga 


ggccccctgg 


900 




cccccgccgg 


aaggggcctt 


cgtgggcttt gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 




gatcttctgg 


ccctggccgc 


cgccaggggc ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 




aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct 


gagcgttctg 


1080 




gccctgaggg 


aaggccttgg 


cctcccgccc ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 




ctggaccctt 


cgaacaccac 


ccccgagggg gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 




gaggaggcgg 


gggagcgggc 


cgccctttcc gagaggctct 


tcgccaacct 


gcttaagagg 


1260 




cttgaggggg 


aggagaggct 


cctttggctt taccgggagg 


tggagaggcc 


cctttccgct 


1320 




gtcctggccc 


atatggaggc 


cacgggggtg cgcctggacg 


tggcctatct 


cagggccttg 


1380 
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tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 144 0 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctcagg 1500 

cttcccaagt tgaagaagac gaagaagacc ggtaagcgct ccagcagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 162 0 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 192 0 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2 04 0 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 210 0 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 80 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2 340 

ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2 517 

<210> 121 
<211> 2517 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 121 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 
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gggtacaagg 


cgggccgggc 


cgagacggag gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg ggagaggctt agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca gggacctgaa ggaggcgcgg gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag ctggaaaggg 


tcctctttga 


cgagctcagg 


1500 


cttcccaagt 


tgaagaagac 


gaagaagacc ggtaagcgct 


ccagcagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc cgggtcttcc aggaggggcg ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc ccccgggagg 


[ ccgtggaccc 


: cctgatgcgc 


1980 
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gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caatggcatc 



cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2 040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 216 0 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2 34 0 

ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 122 
<211> 2517 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 122 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 24 0 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 



480 



gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 



600 



ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 



720 



aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 7 80 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 
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agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 




cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 




gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 




aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 




gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 




ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 




gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 




cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 




gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 




tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 




caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctcagg 


1500 


^ ■• 


cttcccaagt 


tgaagaagac 


gaagaagacc 


ggtaagcgct 


ccagcagcgc 


cgccgtcctg 


1560 




gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 




aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


T 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 




aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 




gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 




gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 




acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 




cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 




caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 




ttccccaagg 


tgcggg<^ctg 


gattgagaag 


accctggagg 


sgggcs-gg^g 


gcgggggtac 


2160 




gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 




gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 






^ggccaL.ggL. 


gaagc tct tc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


n T /I n 




ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 




gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 




gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 
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<210> 123 
<211> 2517 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 123 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 

aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cgcagaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 72 0 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 7 80 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 84 0 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 1020 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 114 0 

ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 12 00 

gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 1260 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 132 0 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 13 80 

tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 144 0 
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caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctcagg 1500 

cttcccaagt tgaagaagac gaagaagacc ggtaagcgct ccagcagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 162 0 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 192 0 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 222 0 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 124 
<211> 2517 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 124 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 

aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 3 60 
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gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


naaccaatcjo 

1^ \^ ^ ^ 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


pa aaaacatcf 


600 


ggggagaaga 


cggggaggaa 


gcttctggag 


aacrtaQcrQQa 

t^-^ 33 — J -3 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


qaqaaqatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggs^cctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


acaaQcraqcc 


catatcrocrcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


ncip^ciccf' 1" a t* 

\^v_j ^ \^ i— ca. 


1020 


aaagccctca gggacctgaa ggaggcgcgg gggcttctcg 


ccaaagacct 


rr;^ cf r a1" 1" c t a 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


y 1— d- V — 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


qctacqqcqq 




1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


y ^ L- *— a.^y 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tqqaqaqqcc 




1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 




1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 




1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttcra 


rrra fTPl" HP CIO 

^y o-y ^ *-^^y 3 


1500 


cttcccaagt 


tgaagaagac 


gaagaagacc 


ggtaagcgct 




cgccy L-ot- uy 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagct cacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


O \^ \^ <^ \^ \^ \^ w ^^^3 


ycH-yyyv-v^y^ 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


crcTPi" ?3 acrtacr 


4- /"* i^/^a -f- 
CCOt-ydH-t— 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


r^r^t~* y-t +- 4- /"t a 

gg c c u u c-ci u 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 
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2220 
2280 
2340 
2400 
2460 
2517 



caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 
ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 
ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 
ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 

<210> 125 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 125 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 18 0 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 

aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 36 0 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 54 0 



600 
660 



gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 
ggggagaata cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 72 0 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 



900 



cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 



580 



gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1 ri o n 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


T n Q n 
± u o u 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


J_ ±^ u 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


12 60 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


T y) /I A 
144 (J 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctcagg 


T IT r» A 


cttcccaagt 


tgaagaagac 


gaagaagacc 


ggtaagcgct 


ccagcagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


168 0 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


18 00 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2 040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 126 
<211> 2517 
<212> DNA 



581 



<213> Artificial 
<220> 

<223> Synthetic 



<400> 126 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggttcacg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 




cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca gggacctgaa ggaggcgcgg gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctcagg 


1500 


cttcccaagt 


tgaagaagac 


gaagaagacc 


ggtaagcgct 


ccagcagcgc 


cgccgtcctg 


1560 



582 



gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 162 0 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 18 60 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 192 0 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2 04 0 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 8 0 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 24 60 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2 517 

<210> 127 
<211> 2517 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 127 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 18 0 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 
aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 



360 



gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 



583 



accgccgaca aagaccttta 


ccagctcctt tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca tcaccccggc 


ctggctttgg gaaaagtacg gcctgaggcc 


cgaccagtgg 


540 


gccgactacc gggccctgac 


cggggacgag tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga cggcgaggaa 


gcttctggag gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc ggctgaagcc 


cgccatccgg gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct gggacctggc 


caaggtgcgc accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg agcccgaccg 


ggagaggctt agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc acgagttcgg 


ccttctggaa agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg aaggggcctt 


cgtgggcttt gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg ccctggccgc 


cgccaggggc ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca gggacctgaa ggaggcgcgg gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg aaggccttgg 


cctcccgccc ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt cgaacaccac 


ccccgagggg gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg gggagcgggc 


cgccctttcc gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg aggagaggct 


cctttggctt taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc atatggaggc 


cacgggggtg cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg tggccgagga 


gatcgcccgc ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca acctcaactc 


ccgggaccag ctggaaaggg 


tcctctttga 


cgagctcagg 


1500 


cttcccaagt tgaagaagac gaagaagacc ggtaagcgct 


ccagcagcgc 


cgccgtcctg 


1560 


gaggccctcc gcgaggccca 


ccccatcgtg gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga gcacctacat 


tgaccccttg ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc gcttcaacca 


gacggccacg gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga acatccccgt 


ccgcaccccg cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg ggtggctatt 


ggtggccctg gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct ccggcgacga 


gaacctgatc cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg ccagctggat 


gttcggcgtc ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca agaccatcaa 


cttcggggtc ctctacggca 


tgtcggccca 


cgccctctcc 


2040 


caggagctag ccatccctta 


cgaggaggcc caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg tgcgggcctg 


gattgagaag accctggagg agggcaggag gcgggggtac 


2160 



584 



gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 222 0 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 80 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 



2400 
2460 
2517 



<210> 128 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 128 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 



300 
360 



480 



gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 54 0 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 6 00 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 72 0 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 



900 
960 



gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 102 0 



585 



gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tgaagaagac 


gaagaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg gagaagatcc 


tccagcaccg ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcgtg 


1800 


gccgaggcgg 


gttgggcgtt 


ggtggccctg 


gactatagcc 


agatagagct 


ccgcgtcctc 


1860 


gcccacctct 


ccggggacga 


aaacctgatc 


agggtcttcc 


aggaggggaa 


ggacatccac 


1920 


acccagaccg 


caagctggat 


gttcggcgtc 


cccccggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agacggtgaa 


cttcggcgtc 


ctctacggca 


tgtccgccca 


taggctctcc 


2040 


caggagcttg 


ccatccccta 


cgaggaggcg 


gtggccttta 


tagagcgcta 


cttccaaagc 


2100 


ttccccaagg 


tgcgggcctg 


gatagaaaag 


accctggagg 


aggggaggaa 


gcggggctac 


2160 


gtggaaaccc 


tcttcggaag 


aaggcgctac 


gtgcccgacc 


tcaacgcccg 


ggtgaagagc 


2220 


gtcagggagg 


ccgcggagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tcgccatggt 


gaagctcttc 


ccccgcctcc 


gggagatggg 


ggcccgcatg 


2340 


ctcctccagg 


tcgccaacga 


gctcctcctg 


gaggcccccc 


aagcgcgggc 


cgaggaggtg 


2400 


gcggctttgg 


ccaaggaggc 


catggagaag 


gcctatcccc 


tcgccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


tgggggagga 


ctggctttcc 


gccaagggtc 


accaccacca 


ccaccac 


2517 



<210> 129 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 



586 



420 
480 



<223> Synthetic 
<400> 129 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 12 0 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 18 0 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 3 00 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 
gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 
atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 54 0 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 720 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 84 0 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtcca ccgggccccc 102 0 

gagccttata aagccctcag ggacctgaag gaggcgcggg ggcttctcgc caaagacctg 1080 

agcgttctgg ccctgaggga aggccttggc ctcccgcccg gcgacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcccggcg ctacggcggg 12 00 

gagtggacgg aggaggcggg ggagcgggcc gccctttccg agaggctctt cgccaacctg 1260 

cttaagaggc ttgaggggga ggagaggctc ctttggcttt accgggaggt ggagaggccc 132 0 

ctttccgctg tcctggccca tatggaggcc acgggggtgc gcctggacgt ggcctatctc 1380 

agggccttgt ccctggaggt ggccgaggag atcgcccgcc tcgaggccga ggtcttccgc 144 0 

ctggccggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt cctctttgac 1500 

gagctagggc ttcccgccat caagaagacg caaaagaccg gcaagcgctc caccagcgcc 1560 

gccgtcctgg aggccctccg cgaggcccac cccatcgtgg agaagatcct gcagtaccgg 162 0 
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gagctcacca agctgaagag cacctacatt gaccccttgc cggacctcat ccaccccagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacgggcag gctaagtagc 1740 

tccgatccca acctccagaa catccccgtc cgcaccccgc ttgggcagag gatccgccgg 1800 

gccttcatcg ccgaggaggg gtggctattg gtggccctgg actatagcca gatagagctc 186 0 

^gggtgctgg cccacctctc cggcgacgag aacctgatcc gggtcttcca ggaggggcgg 1920 

gacatccaca cggagaccgc cagctggatg ttcggcgtcc cccgggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gaccatcaac ttcggggtcc tctacggcat gtcggcccac 2 04 0 

cgcctctccc aggagctagc catcccttac gaggaggccc aggccttcat tgagcgctac 2100 

tttcagagct tccccaaggt gcgggcctgg attgagaaga ccctggagga gggcaggagg 2160 

cgggggtacg tggagaccct cttcggccgc cgccgctacg tgccagacct agaggcccgg 222 0 

gtgaagagcg tgcgggaggc ggccgagcgc atggccttca acatgcccgt ccagggcacc 22 80 

gccgccgacc tcatgaagct ggctatggtg aagctcttcc ccaggctgga ggaaatgggg 2340 

gccaggatgc tccttcaggt cgccaacgag ctggtcctcg aggccccaaa agagagggcg 24 0 0 

gaggccgtgg cccggctggc caaggaggtc atggaggggg tgtatcccct ggccgtgccc 24 60 

ctggaggtgg aggtggggat aggggaggac tggctctccg ccaaggagca ccaccaccac 2 52 0 

caccac 2526 

<210> 130 
<211> 2508 
<212> DMA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 130 

atggaattca ccccactttt tgacctggag gaacccccca agcgggtgct tctggtggac 60 

ggccaccacc tggcctaccg caccttctat gccctgagcc tcaccacctc ccggggggag 120 

ccggtgcaga tggtctacgg cttcgcccgg agcctcctca aggccttgaa ggaggacgga 180 

caggcggtgg tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacgag 240 

gcctacaagg cgggccgggc ccccaccccg gaggacttcc cccgccagct cgccttggtc 3 00 

aagcggctgg tggaccttct gggcctggtc cgcctcgagg ccccggggta cgaggcggac 3 60 

gacgtcctgg gcaccctggc caagaaggcc gaaagggagg ggatggaggt gcgcatcctc 420 

^cgggagacc gggacttctt ccagctcctc tccgagaagg tctcggtcct cctgccggac 480 
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gggaccctgg 


tcaccccaaa 


ggacguccag gagaaguacg yyy uy^^^-^wu-u yya.yv-y*-'*-^^ 


540 


gtggacttcc 


gcgccctcac 


gggggaccgc ucggacaaca uccccgyyyu yyv^yyyy<^^<=>- 


600 


ggggagaaga 


ccgcccttcg 


— ^ 4- >-i *- y-t rr^ni-r^rmrfri^ ncnf'CICH^ririPi CCtCCtQaaCT 

accccncgca yayuyyyyya. yt-y uyycLcxdo. *^*^i_\H-v^*-^<-*tAvj 


660 


aacctggacc 


gggtaaagcc 


ggactcgctc cggcgcaaga uayciyyuyuci i^i^uv^ydyya.^^ 


720 


ctccacctct 


ccttagacct 


ggcccgcatc cgcaccgacc ccccccuyyct yy uyycu-L-uu 


780 


aaggccctgc 


gccgcaggac 


CCCCgacCtg gagggccuya ggycL-ut-uuL. yyo-yya-y^uy 


840 


gagttcggaa 


gcctcctcca 


cgagtucggc ccccugggag ygyciyciciyut- '-^-yyyciyya.vj 


900 


gccccctggc 


ccccgcccga 


aggggccucc gcgggcui-cc ucc-cuut-ut^y t^cLciyyca.'-j*^'^'^ 


960 


atgtgggcgg 


agcttctggc 


cctggcggcg gcctcggagg gccggguct^d t^v^yyy^^cicn-a. 


1020 


agcccggttg 


aggccctggc 


cgaccucaag gaggcccggg yyuuoouyyu. \^cLc^.yyca.^^v^^y 


1080 


gccgttttgg 


ccctgcggga 


gggggtggcc ccggacccca cyyctuycioi^^- i^i^uv^L-^-y^^-y 


1140 


gcctacctcc 


tggacccggc 


caacacccac cccgaggggg tggcccggcg ccdL-yyyyyu 


1200 


gagttcacgg 


aggacgcagc 


ggagagggcc ctcctctccg agaggctctt ccagaacctc 


1260 


tttaaacggc 


tttccgagaa 


gctcctctgg ctctaccagg aggtggagcg gcccctctcc 


1320 


cgggtcttgg 


cccacatgga 


ggcccggggg gtgaggctgg acgtccccct tctggaggcc 


1380 


ctctcctttg 


agctggagaa ggagatggag cgcctggagg gggaggtctt ccgtttggcc 


1440 


ggccacccct 


tcaacctcaa 


ctcccgcgac cagctggaaa gggtcctctt tgacgagctg 


1500 


ggcctcaccc 


cggtgggccg 


gacgcagaag acgggcaagc gctccaccgc ccagggggcc 


1560 


ctggaggccc 


tccggggggc 


ccaccccatc gtggagctca tcctccagta ccgggagctt 


1620 


tccaagctca 


aaagcaccta 


cctggacccc ctgccccggc tcgtccaccc gcggacgggc 


1680 


cggctccaca 


cccgcttcaa 


ccagacggcc acggccacgg gaaggctttc cagctccgac 


174 0 


cccaacctgc 


agaacatccc 


cgtgcgcacc cccttggggc agcgcatccg caaggccttc 


X o u u 


gtggccgagg 


aggggtggct 


ccttttggcg gcggactact cccagattga gctccgggtc 


J. O D U 


ctggcccacc 


tctcggggga 


cgagaacctg aagcgggtct tccgggaggg gaaggacatc 




cataccgaga 


ccgccgcctg 


gatgttcggc ttagaccccg ctctggtgga tccaaagatg 


1980 


cgccgggcgg 


ccaagacggt 


caacttcggc gtcctctacg ggatgtccgc ccacaggctc 


2040 


tcccaggagc 


tcggcataga 


ctacaaggag gcggaggcct ttattgagcg ctacttccag 


2100 


agcttcccca 


aggtgcgggc 


ctggatagaa aggaccctgg aggagggccg gacgcggggc 


2160 
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2508 



tacgtggaga ccctgttcgg caggaggcgc tatgtgcccg acctggcctc ccgggtccgc 2220 

tcggtgcggg aggcggcgga gcggatggcc ttcaacatgc ccgtgcaggg caccgccgcc 2 2 80 

gacctgatga agatcgccat ggtcaagctc ttccccaggc taaagcccct gggggcccac 2340 

ctcctcctcc aagtgcacaa cgagctggtc ctggaggtgc ccgaggaccg ggccgaggag 24 0 0 

gccaaggccc tggtcaagga ggtcatggag aacgcctacc ccctggacgt gcccctcgag 2460 

gtggaggtgg gcgtgggtcg ggactggctg gaggcgaagc aggattga 



<210> 131 

<211> 2499 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 131 



atggaattcc 


tgcccctctt 


tgagcccaag 


ggccgggtgc 


ttctggtgga 


cggccaccac 


60 


ctggcctacc 


gtaccttttt 


tgccctgaag 


ggcctcacca 


ccagccgcgg 


ggagccggtc 


120 


caggcggtgt 


acgggtttgc 


caagagcctt 


ttgaaggcgc 


taagggaaga 


cggggatgtg 


180 


y eg a Coy '-y y 


■h(~ri~ t~ "1" n 3 fT r* 
L.y u i— L.ycn^y^-^ 


caaggccccc 


tccttccgcc 


accagaccta 


cgaggcctac 


240 


aaggcggggc 


gggctcccac 


ccccgaggac 


tttccccggc 


agcttgccct 


tatcaaggag 


300 


atggtggacc 


ttttgggcct 


ggagcgcctc 


gaggtgccgg 


gctttgaagc 


ggatgacgtc 


360 


ctggctaccc 


tggccaagaa 


ggcggaaaag 


gaaggctacg 


aagtgcgcat 


cctcaccgcg 


420 


gaccgggacc 


tttaccagct 


tctttcggag 


cgaatctcca 


tccttcaccc 


ggagggttac 


480 


ctgatcaccc 


cggagtggct 


ttgggagaag 


tatgggctta 


agccttccca 


gtgggtggac 


540 


taccgggcct 


tggccgggga 


cccttccgac 


aacatccccg 


gcgtgaaggg 


catcggggag 


600 


aagacggcgg 


ccaagctgat 


ccgggagtgg ggaagcctgg 


aaaaccttct 


taagcacctg 


660 


gaacaggtga 


aacctgcctc 


cgtgcgggag 


aagatcctta 


gccacatgga 


ggacctcaag 


720 


ctatccctgg 


agctatcccg 


ggtgcacacg 


gacttgctcc 


ttcaggtgga 


cttcgcccgg 


780 


cgccgggagc 


cggaccggga 


ggggcttaag 


gcctttttgg 


agaggctgga 


gttcggaagc 


840 


ctcctccacg 


agttcggcct 


gttggaaagc 


ccggtggcgg 


cggaggaagc 


tccctggccg 


900 


ccccccgagg 


gagccttcgt 


ggggtacgtt 


ctttcccgcc 


ccgagcccat 


gtgggcggag 


960 


cttaacgcct 


tggccgccgc 


ctgggaggga 


agggtttacc 


gggcggagga 


tcccttggag 


1020 


gccttgcggg 


ggcttgggga 


ggtgaggggg 


cttttggcca 


aggacctggc 


ggtgctggcc 


1080 
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ctgagggaag ggattgccct ggcaccgggc gacgacccca 


tgctcctcgc 


ctacctcctg 


1140 


gatccttcca acaccgcccc cgaaggggta gcccggcgct acggggggga gtggaccgag 


1200 


gaggcggggg aaagggcgct gctttccgaa aggctttacg ccgccctcct gaagcggctt 


1260 


^agggggagg aga.yyv^ cuyyv-L-uua.^- '^^^^y-^riij^-rpi) 


aaaagcccct 


ttcgcgggtc 


1320 


ccggcccaca ugydyyuv-d.^^ yyyyy *-*='^'-yy 


cctacttaaa 


ggccctttcc 


1380 


cuggagguyy a-yy^yy«-y<^*- ctciyy^y^->-i-*-' ^'■^zjzj^zjzj^j^ 


tccaccgcct 


ggccgggcat 


1440 


r-< -h I- 1- r- 3 r- "hrr;^ 3 rt" OCS3iCC3.0CXiQ QaaaOQQtCa 


tctttgacga 


gcttgggctt 


1500 


CCCgCCauCg gca.a.ycn-.yt-ci yctcaycu^yyy^ cici.vj^^vj>^ 


ccagcgccgc 


cgttttggag 


1560 


^i-«*^-t--i->^*^*-irT(^ ar^rrr*"!- na i" rr* 1~ PCrt'Cra'ac CCTCatCCttC 
yCCUUyCyyy a.yyc i-i— k^o. uv-y »-y^ci.v-. 


aqtaccggga 


gctttccaag 


1620 


ctcaagggaa cct.acat.cya tt-t-uttyot-t ^^^k^^k^^^'^^'^'^ 


accccaagac 


gaaccgcctc 


1680 


cacacccytt toctciov.-ciyci.»^ yyi^\^o.*w-v^^v-'^ '-'■'^^^^^"•rj^^ 


ttagcagctc 


ggatcctaat 


1740 


ctgcaaaata tccccgtgcg ca.ccc^u ttty yy-^ciy^yyci 


tccgccgggc 


cttcgtggcc 


1800 


gaggaggggt ggaggctggt ggttttggac tacagccaga 


ttgagctcag 


ggtcctggcg 


1860 


cacctttccg gggacgagaa cctaatccgg gtcttccagg 


agggccagga 


catccacacc 


1920 


cagacggcca gctggatgtt cggcgtgccc ccagaggccg 


tggattccct 


gatgcgccgg 


1 Q R n 

X -? o u 


gcggccaaga ccatcaactt cggcgtcctc tacggcatgt 


ccgcccaccg 


gctttcggga 


2040 


gagctggcca tcccctacga ggaggcggtg gccttcatcg 


agcggtattt 


ccagagctac 


2100 


cccaaggtgc gggcctggat tgagaaaacc ctggcggaag 


gacgggaacg 


gggctatgtg 


2160 


gaaaccctct ttggccgccg gcgctacgtg cccgacttgg cttcccgggt gaagagcatc 


2220 


cgggaggcag cggagcgcat ggccttcaac atgccggtcc 


aggggaccgc 


cgcggatttg 


2280 


atgaaactgg ccatggtgaa gctctttccc aggcttcagg 


agctgggggc 


caggatgctt 


2340 


ttgcaggtgc acaacgaact ggtcctcgag gctcccaagg 


agcaagcgga 


ggaagtcgcc 


2400 


caggaggcca agcggaccat ggaggaggtg tggcccctga 


aggtgccctt 


ggaggtggaa 


2460 


gtgggcatcg gggaggactg gctttccgcc aaggcctag 






2499 



<210> 132 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 
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<400> 132 

atgaattcca ccccactttt tgacctggag gaacccccca agcgggtgct tctggtggac 60 

ggccaccacc tggcctaccg caccttctat gccctgagcc tcaccacctc ccggggggag 12 0 

ccggtgcaga tggtctacgg cttcgcccgg agcctcctca aggccttgaa ggaggacgga 180 

caggcggtgg tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacgag 240 

gcctacaagg cgggccgggc ccccaccccg gaggacttcc cccgccagct cgccttggtc 300 

aagcggctgg tggaccttct gggcctggtc cgcctcgagg ccccggggta cgaggcggac 360 

gacgtcctgg gcaccctggc caagaaggcc gaaagggagg ggatggaggt gcgcatcctc 42 0 

acgggagacc gggacttctt ccagctcctc tccgagaagg tctcggtcct cctgccggac 480 

gggaccctgg tcaccccaaa ggacgtccag gagaagtacg gggtgccccc ggagcgctgg 540 

gtggacttcc gcgccctcac gggggaccgc tcggacaaca tccccggggt ggcggggata 60 0 

ggggagaaga ccgcccttcg actcctcgca gagtggggga gcgtggaaaa cctcctgaag 660 

aacctggacc gggtaaagcc ggactcgctc cggcgcaaga tagaggcgca cctcgaggac 720 

ctccacctct ccttagacct ggcccgcatc cgcaccgacc tccccctgga ggtggacttt 780 

aaggccctgc gccgcaggac ccccgacctg gagggcctga gggccttttt ggaggagctg 840 

gagttcggaa gcctcctcca cgagttcggc ctcctgggag gggagaagcc ccgggaggag 90 0 

gccccctggc ccccgcccga aggggccttc gtgggcttcc tcctttcccg caaggagccc 960 

atgtgggcgg agcttctggc cctggcggcg gcctcgggcg gccgcgtgca ccgggcagca 102 0 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 12 00 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 132 0 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 13 80 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 
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acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 174 0 
tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 
gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 
cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 
gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 
ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 
aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 
ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 
cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 
gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 
gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 
gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 
gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 
ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 252 0 
caccac 



<210> 133 

<211> 2514 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 133 

atgaattccc tgcccctctt tgagcccaag ggccgggtgc ttctggtgga cggccaccac 



2040 
2100 
2160 



2280 
2340 
2400 
2460 



2526 



ctggcctacc gtaccttttt tgccctgaag ggcctcacca ccagccgcgg ggagccggtc 
caggcggtgt acgggtttgc caagagcctt ttgaaggcgc taagggaaga cggggatgtg 
gtgatcgtgg tgtttgacgc caaggccccc tccttccgcc accagaccta cgaggcctac 
aaggcggggc gggctcccac ccccgaggac tttccccggc agcttgccct tatcaaggag 
atggtggacc ttttgggcct ggagcgcctc gaggtgccgg gctttgaagc ggatgacgtc 3 60 
ctggctaccc tggccaagaa ggcggaaaag gaaggctacg aagtgcgcat cctcaccgcg 42 0 
gaccgggacc tttaccagct tctttcggag cgaatctcca tccttcaccc ggagggttac 



60 
12 0 
180 
240 
300 



480 
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ctgatcaccc cggagtggct ttgggagaag tatgggctta agccttccca gtgggtggac 540 

taccgggcct tggccgggga cccttccgac aacatccccg gcgtgaaggg catcggggag 600 

aagacggcgg ccaagctgat ccgggagtgg ggaagcctgg aaaaccttct taagcacctg 660 

gaacaggtga aacctgcctc cgtgcgggag aagatcctta gccacatgga ggacctcaag 72 0 

ctatccctgg agctatcccg ggtgcacacg gacttgctcc ttcaggtgga cttcgcccgg 780 

cgccgggagc cggaccggga ggggcttaag gcctttttgg agaggctgga gttcggaagc 840 

ctcctccacg agttcggcct gttggaaagc ccggtggcgg cggaggaagc tccctggccg 90 0 

ccccccgagg gagccttcgt ggggtacgtt ctttcccgcc ccgagcccat gtgggcggag 960 

cttaacgcct tggccgccgc ctggggcggc cgcgtgcacc gggcagcaga ccccttggcg 102 0 

gggctaaagg acctcaagga ggtccggggc ctcctcgcca aggacctcgc cgtcttggcc 1080 

tcgagggagg ggctagacct cgtgcccggg gacgacccca tgctcctcgc ctacctcctg 1140 

gacccttcga acaccacccc cgagggggtg gcgcggcgct acggggggga gtggacggag 1200 

gacgccgccc accgggccct cctctcggag aggctccatc ggaacctcct taagcgcctc 1260 

gagggggagg agaagctcct ttggctctac cacgaggtgg aaaagcccct ctcccgggtc 132 0 

ctggcccata tggaggccac cggggtacgg cgggacgtgg cctaccttca ggccctttcc 1380 

ctggagcttg cggaggagat ccgccgcctc gaggaggagg tcttccgctt ggcgggccac 144 0 

cccttcaacc tcaactcccg ggaccagctg gaaagggtgc tctttgacga gcttaggctt 1500 

cccgccttga agaagacgaa gaagacaggc aagcgctcca ccagcgccgc ggtgctggag 1560 

gccctacggg aggcccaccc catcgtggag aagatcctcc agcaccggga gctcaccaag 162 0 

ctcaagaaca cctacgtgga ccccctccca agcctcgtcc acccgaggac gggccgcctc 1680 

cacacccgct tcaaccagac ggccacggcc acggggaggc ttagtagctc cgaccccaac 1740 

ctgcagaaca tccccgtccg cacccccttg ggccagagga tccgccgggc cttcgtggcc 1800 

gaggcgggtt gggcgttggt ggccctggac tatagccaga tagagctccg cgtcctcgcc 1860 

cacctctccg gggacgaaaa cctgatcagg gtcttccagg aggggaagga catccacacc 1920 

cagaccgcaa gctggatgtt cggcgtcccc ccggaggccg tggaccccct gatgcgccgg 1980 

gcggccaaga cggtgaactt cggcgtcctc tacggcatgt ccgcccatag gctctcccag 2040 

gagcttgcca tcccctacga ggaggcggtg gcctttatag agcgctactt ccaaagcttc 2100 

cccaaggtgc gggcctggat agaaaagacc ctggaggagg ggaggaagcg gggctacgtg 2160 

gaaaccctct tcggaagaag gcgctacgtg cccgacctca acgcccgggt gaagagcgtc 2220 
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agggaggccg cggagcgcat ggccttcaac atgcccgtcc agggcaccgc cgccgacctc 2280 

atgaagctcg ccatggtgaa gctcttcccc cgcctccggg agatgggggc ccgcatgctc 2340 

ctccaggtcg ccaacgagct cctcctggag gccccccaag cgcgggccga ggaggtggcg 2400 

gctttggcca aggaggccat ggagaaggcc tatcccctcg ccgtgcccct ggaggtggag 2460 

gtggggatgg gggaggactg gctttccgcc aagggtcacc accaccacca ccac 2514 

<210> 134 
<211> 2526 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 134 

atgaattcca ccccactttt tgacctggag gaacccccca agcgggtgct tctggtggac 60 

ggccaccacc tggcctaccg caccttctat gccctgagcc tcaccacctc ccggggggag 12 0 

ccggtgcaga tggtctacgg cttcgcccgg agcctcctca aggccttgaa ggaggacgga 180 

caggcggtgg tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacgag 240 

gcctacaagg cgggccgggc ccccaccccg gaggacttcc cccgccagct cgccttggtc 300 

aagcggctgg tggaccttct gggcctggtc cgcctcgagg ccccggggta cgaggcggac 3 60 

gacgtcctgg gcaccctggc caagaaggcc gaaagggagg ggatggaggt gcgcatcctc 420 

acgggagacc gggacttctt ccagctcctc tccgagaagg tctcggtcct cctgccggac 480 

gggaccctgg tcaccccaaa ggacgtccag gagaagtacg gggtgccccc ggagcgctgg 54 0 

gtggacttcc gcgccctcac gggggaccgc tcggacaaca tccccggggt ggcggggata 600 

ggggagaaga ccgcccttcg actcctcgca gagtggggga gcgtggaaaa cctcctgaag 660 

aacctggacc gggtaaagcc ggactcgctc cggcgcaaga tagaggcgca cctcgaggac 72 0 

ctccacctct ccttagacct ggcccgcatc cgcaccgacc tccccctgga ggtggacttt 780 

aaggccctgc gccgcaggac ccccgacctg gagggcctga gggccttttt ggaggagctg 840 

gagttcggaa gcctcctcca cgagttcggc ctcctgggag gggagaagcc ccgggaggag 900 

gccccctggc ccccgcccga aggggccttc gtgggcttcc tcctttcccg caaggagccc 960 

atgtgggcgg agcttctggc cctggcggcg gcctcgggcg gccgcgtcca ccgggccccc 102 0 

gagccttata aagccctcag ggacctgaag gaggcgcggg ggcttctcgc caaagacctg 10 8 0 
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agcgttctgg ccctgaggga aggccttggc 

gcctacctcc tggacccttc gaacaccacc 

gagtggacgg aggaggcggg ggagcgggcc 

cttaagaggc ttgaggggga ggagaggctc 

ctttccgctg tcctggccca tatggaggcc 

agggccttgt ccctggaggt ggccgaggag 

ctggccggcc accccttcaa cctcaactcc 

gagctagggc ttcccgccat caagaagacg 

gccgtcctgg aggccctccg cgaggcccac 

gagctcacca agctgaagag cacctacatt 

& acgggccgcc tccacacccg cttcaaccag 

ffi tccgatccca acctccagaa catccccgtc 

jF gccttcatcg ccgaggaggg gtggctattg 

agggtgctgg cccacctctc cggcgacgag 

gacatccaca cggagaccgc cagctggatg 

D ctgatgcgcc gggcggccaa gaccatcaac 

ft cgcctctccc aggagctagc catcccttac 

i tttcagagct tccccaaggt gcgggcctgg 

cgggggtacg tggagaccct cttcggccgc 

gtgaagagcg tgcgggaggc ggccgagcgc 

gccgccgacc tcatgaagct ggctatggtg 

gccaggatgc tccttcaggt cgccaacgag 

gaggccgtgg cccggctggc caaggaggtc 

ctggaggtgg aggtggggat aggggaggac 
caccac 

<210> 135 
<211> 2514 
<212> DNA 
<213> Artificial 

<220> 



ctcccgcccg gcgacgaccc catgctcctc 1140 

cccgaggggg tggcccggcg ctacggcggg 1200 

gccctttccg agaggctctt cgccaacctg 1260 

ctttggcttt accgggaggt ggagaggccc 132 0 

acgggggtgc gcctggacgt ggcctatctc 1380 

atcgcccgcc tcgaggccga ggtcttccgc 144 0 

cgggaccagc tggaaagggt cctctttgac 1500 

caaaagaccg gcaagcgctc caccagcgcc 1560 

cccatcgtgg agaagatcct gcagtaccgg 162 0 

gaccccttgc cggacctcat ccaccccagg 1680 

acggccacgg ccacgggcag gctaagtagc 1740 

cgcaccccgc ttgggcagag gatccgccgg 180 0 

gtggccctgg actatagcca gatagagctc 1860 

aacctgatcc gggtcttcca ggaggggcgg 192 0 

ttcggcgtcc cccgggaggc cgtggacccc 198 0 

ttcggggtcc tctacggcat gtcggcccac 2040 

gaggaggccc aggccttcat tgagcgctac 210 0 

attgagaaga ccctggagga gggcaggagg 2160 

cgccgctacg tgccagacct agaggcccgg 222 0 

atggccttca acatgcccgt ccagggcacc 2280 

aagctcttcc ccaggctgga ggaaatgggg 234 0 

ctggtcctcg aggccccaaa agagagggcg 24 0 0 

atggaggggg tgtatcccct ggccgtgccc 2460 

tggctctccg ccaaggagca ccaccaccac 252 0 

2526 
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<223> Synthetic 














<400> 135 
atgaattccc 


tgcccctctt 


tgagcccaag 


ggccgggtgc 


ttctggtgga 


cggccaccac 


60 




ctggcctacc 


gtaccttttt 


tgccctgaag 


ggcctcacca 


ccagccgcgg 


ggagccggtc 


120 




caggcggtgt 


acgggtttgc 


caagagcctt 


ttgaaggcgc 


taagggaaga 


cggggatgtg 


180 




gtgatcgtgg 


tgtttgacgc 


caaggccccc 


tccttccgcc 


accagaccta 


cgaggcctac 


240 




aaggcggggc 


gggctcccac 


ccccgaggac 


tttccccggc 


agcttgccct 


tatcaaggag 


300 




atggtggacc 


ttttgggcct 


ggagcgcctc 


gaggtgccgg 


gctttgaagc 


ggatgacgtc 


360 




ctggctaccc 


tggccaagaa 


ggcggaaaag 


gaaggctacg 


aagtgcgcat 


cctcaccgcg 


420 




gaccgggacc 


tttaccagct 


tctttcggag 


cgaatctcca 


tccttcaccc 


ggagggttac 


480 




ctgatcaccc 


cggagtggct 


ttgggagaag 


tatgggctta 


agccttccca 


gtgggtggac 


540 


■§ 


taccgggcct 


tggccgggga 


cccttccgac 


aacatccccg gcgtgaaggg 


catcggggag 


600 


"if"" 


•aagacggcgg 


ccaagctgat 


ccgggagtgg 


ggaagcctgg 


aaaaccttct 


taagcacctg 


660 




gaacaggtga 


aacctgcctc 


cgtgcgggag 


aagatcctta 


gccacatgga 


ggacctcaag 


720 




ctatccctgg 


agctatcccg 


ggtgcacacg 


gacttgctcc 


ttcaggtgga 


cttcgcccgg 


780 




cgccgggagc 


cggaccggga 


ggggcttaag 


gcctttttgg 


agaggctgga 


gttcggaagc 


840 


r^'i '; 


ctcctccacg 


agttcggcct 


gttggaaagc 


ccggtggcgg 


cggaggaagc 


tccctggccg 


900 




ccccccgagg gagccttcgt 


ggggtacgtt 


ctttcccgcc 


ccgagcccat 


gtgggcggag 


960 




cttaacgcct 


tggccgccgc 


ctggggcggc 


cgcgtccacc 


gggcccccga 


gccttataaa 


1020 




gccctcaggg 


acctgaagga 


ggcgcggggg 


cttctcgcca 


aagacctgag 


cgttctggcc 


1080 




ctgagggaag 


gccttggcct 


cccgcccggc 


gacgacccca 


tgctcctcgc 


ctacctcctg 


1140 




gacccttcga 


acaccacccc 


cgagggggtg gcccggcgct 


acggcgggga 


gtggacggag 


1200 




gaggcggggg 


agcgggccgc 


cctttccgag 


aggctcttcg 


ccaacctgct 


taagaggctt 


1260 




gagggggagg 


agaggctcct 


ttggctttac 


cgggaggtgg 


agaggcccct 


ttccgctgtc 


1320 




ctggcccata 


tggaggccac 


gggggtgcgc 


ctggacgtgg 


cctatctcag 


ggccttgtcc 


1380 




ctggaggtgg 


ccgaggagat 


cgcccgcctc 


gaggccgagg 


tcttccgcct 


ggccggccac 






cccttcaacc 


tcaactcccg 


ggaccagctg 


gaaagggtcc 


tctttgacga 


gctagggctt 


1500 




cccgccatca 


agaagacgca 


aaagaccggc 


aagcgctcca 


ccagcgccgc 


cgtcctggag 


1560 




gccctccgcg 


aggcccaccc 


catcgtggag 


aagatcctgc 


agtaccggga 


gctcaccaag 


1620 
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ctgaagagca 


cctacattga 


ccccttgccg 


gacctcatcc 


accccaggac 


qqqccqcctc 


1680 


ca.ca.ccccfct 


tcaaccagac 


ggc cacggc c 


aCQaqcaqqc 


taagtagctc 


cgatcccaac 


1740 


ctccagaaca. 


tccccgtccg 


caccccgctt 


qqqcaqaqqa 


tccqccqqqc 


cttcatcgcc 


1800 




QQCtattQQt 


ggccctggac 


tatagccaga 


taqaqctcaq 


qqtqctqqcc 


1860 


cacctctccg 


Qcqacqaqaa 


cctgatccgg 


qtcttccagq 


aqgqqcqgqa 


catccacacg 


1920 


gagaccgcca 


gctggatgtt 


cggcgtcccc 


cqqqaqqccq 

*-'!333 -33 -3 


tggaccccct 


qatqcqccqq 


1980 


gcggccaaga 


ccatcaactt 


cqqqqtcctc 


tacggcatgt 


cggcccaccg 


cctctcccag 


2040 


gage t age ca 


tcccttacga 


ggaggc c c ag 


gccttcattg 


^gcgctactt 


tcagagcttc 


2100 


cccaaggtgc 


aqqcctqaat 


tgagaagacc 


Ctqqaqqaqq 


qcaqqaqqcq 


qqqqtacqtq 


2160 


gagaccctct 


tcqqccqccq 

^^3_3^^-3^^13 


ccgctacgtg 


ccagacctag 


aqqcccqqqt 


qaaqaqcqtq 


2220 


CQQQacrcrcQQ 


ccgagcgcat 


ggccttcaac 


atgcccgtcc 


agggcaccgc 


cgccgacctc 


2280 


atgaagctgg 


ctatggtgaa 


gctcttcccc 


aggctggagg 


aaatgggggc 


caggatgctc 


2340 


cttcaggtcg 


ccaacgagct 


ggtcctcgag 


gccccaaaag 


^g^gggcgga 


ggccgtggcc 


2400 


cggctggcca 


aggaggtcat 


ggagggggtg 


tatcccctgg 


ccgtgcccct 


ggaggtggag 


2460 


gtggggatag 


gggaggactg 


gctctccgcc 


aaggagcacc 


accaccacca 


ccac 


2514 



<210> 136 

<211> 320 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 136 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg lie Asn Ser Gly 
15 10 15 

Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val Asp Gly 
20 25 30 

His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly Leu Thr Thr 
35 40 45 

Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys Ser Leu 
50 55 60 

Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie Val Val Phe Asp 
65 70 75 80 

Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly Tyr Lys Ala 
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85 



90 



95 



Gly Arg Ala Pro Thr Leu Val Pro Arg Gly Ser Glu Asp Phe Pro Arg 
100 105 110 

Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg 
115 120 125 

Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala 
130 135 140 

Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp 
145 150 155 160 

Lys Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro 
165 170 175 

Glu Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu 
180 185 190 

Arg Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser 
195 200 205 

Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys 
210 215 220 

Leu Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp 
225 230 235 240 

Arg Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp 
245 250 255 

Leu Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu 
260 265 270 

Glu Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg 
275 280 285 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
290 295 300 

Leu Leu Glu Ser Pro Lys Ala Ala Leu Glu His His His His His His 
305 310 315 320 

<210> 137 

<211> 73 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 137 

ccctatcttt aaagttttta aaaagtttga cccccttttg ggggccctat ctttaaagtt 60 
tttaaaaatt tga 73 
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<210> 138 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 138 
cgcgcggaac gcgcg 



<210> 139 

<211> 16 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 139 
cccgggtttt cccggg 



<210> 140 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 140 

aggcgcacca atttggtgtt 



<210> 


141 


<211> 


53 


<212> 


RNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


141 



uucgcuuucu ucccuuccuu ucucgccacg uucgccggcu uuccccguca age 



<210> 142 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 142 

acggggaaag ccggcgaacg tggcgagaaa 



<210> 143 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 143 

attagaaagg aagggaagaa agcgaa 

<210> 144 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 144 

acggggaaag ccggcgaacg tggcgagaac 



<210> 145 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 145 

cttgacgggg aaagccggcg aacgtggcgc 



<210> 146 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 146 

agaaaggaag ggaagaaagc gaa 



<210> 147 

<211> 63 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 147 

gcgcggcggc gggtgtggtg gttacgcgca gcgtgaccgc tacacttgcc agcgc 
cgc 



<210> 148 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 148 

gcgctagggc gctggcaagt gtagcggtca 



<210> 149 

<211> 35 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 149 

gatcgctgcg cgtaaccacc acacccgcgc cgcgc 



<210> 150 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 150 

ggactctgcc tcaagacggt agtcaacgtg 



<210> 151 

<211> 16 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 151 
cacgttgact accgtc 
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<210> 152 

<211> 34 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 152 

catgtcaagc agtcctaact ttgaggcaga gtcc 



<210> 153 

<211> 2506 

<212> DNA 

<213> Thermus aquaticus 



34 



<400> 153 

atgaggggga tgctgcccct ctttgagccc aagggccggg tcctcctggt ggacggccac 60 

cacctggcct accgcacctt ccacgccctg aagggcctca ccaccagccg gggggagccg 12 0 

gtgcaggcgg tctacggctt cgccaagagc ctcctcaagg ccctcaagga ggacggggac 18 0 

gcggtgatcg tggtctttga cgccaaggcc ccctccttcc gccacgaggc ctacgggggg 240 

tacaaggcgg gccgggcccc cacgccggag gactttcccc ggcaactcgc cctcatcaag 300 

gagctggtgg acctcctggg gctggcgcgc ctcgaggtcc cgggctacga ggcggacgac 3 60 

gtcctggcca gcctggccaa gaaggcggaa aaggagggct acgaggtccg catcctcacc 42 0 

gccgacaaag acctttacca gctcctttcc gaccgcatcc acgtcctcca ccccgagggg 480 

tacctcatca ccccggcctg gctttgggaa aagtacggcc tgaggcccga ccagtgggcc 540 

gactaccggg ccctgaccgg ggacgagtcc gacaaccttc ccggggtcaa gggcatcggg 600 

gagaagacgg cgaggaagct tctggaggag tgggggagcc tggaagccct cctcaagaac 660 

ctggaccggc tgaagcccgc catccgggag aagatcctgg cccacatgga cgatctgaag 72 0 

ctctcctggg acctggccaa ggtgcgcacc gacctgcccc tggaggtgga cttcgccaaa 78 0 

aggcgggagc ccgaccggga gaggcttagg gcctttctgg agaggcttga gtttggcagc 84 0 

ctcctccacg agttcggcct tctggaaagc cccaaggccc tggaggaggc cccctggccc 900 

ccgccggaag gggccttcgt gggctttgtg ctttcccgca aggagcccat gtgggccgat 960 

cttctggccc tggccgccgc cagggggggc cgggtccacc gggcccccga gccttataaa 1020 

gccctcaggg acctgaagga ggcgcggggg cttctcgcca aagacctgag cgttctggcc 108 0 

ctgagggaag gccttggcct cccgcccggc gacgacccca tgctcctcgc ctacctcctg 1140 

gacccttcca acaccacccc cgagggggtg gcccggcgct acggcgggga gtggacggag 1200 
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gaggcggggg agcgggccgc cctttccgag aggctcttcg ccaacctgtg ggggaggctt 12 60 
gagggggagg agaggctcct ttggctttac cgggaggtgg agaggcccct ttccgctgtc 132 0 
ctggcccaca tggaggccac gggggtgcgc ctggacgtgg cctatctcag ggccttgtcc 
ctggaggtgg ccgaggagat cgcccgcctc gaggccgagg tcttccgcct ggccggccac 
cccttcaacc tcaactcccg ggaccagctg gaaagggtcc tctttgacga gctagggctt 
cccgccatcg gcaagacgga gaagaccggc aagcgctcca ccagcgccgc cgtcctggag 
gccctccgcg aggcccaccc catcgtggag aagatcctgc agtaccggga gctcaccaag 
ctgaagagca cctacattga ccccttgccg gacctcatcc accccaggac gggccgcctc 
cacacccgct tcaaccagac ggccacggcc acgggcaggc taagtagctc cgatcccaac 
ctccagaaca tccccgtccg caccccgctt gggcagagga tccgccgggc cttcatcgcc 
gaggaggggt ggctattggt ggccctggac tatagccaga tagagctcag ggtgctggcc 
cacctctccg gcgacgagaa cctgatccgg gtcttccagg aggggcggga catccacacg 
gagaccgcca gctggatgtt cggcgtcccc cgggaggccg tggaccccct gatgcgccgg 
gcggccaaga ccatcaactt cggggtcctc tacggcatgt cggcccaccg cctctcccag 
gagctagcca tcccttacga ggaggcccag gccttcattg agcgctactt tcagagcttc 
cccaaggtgc gggcctggat tgagaagacc ctggaggagg gcaggaggcg ggggtacgtg 
gagaccctct tcggccgccg ccgctacgtg ccagacctag aggcccgggt gaagagcgtg 
cgggaggcgg ccgagcgcat ggccttcaac atgcccgtcc agggcaccgc cgccgacctc 
atgaagctgg ctatggtgaa gctcttcccc aggctggagg aaatgggggc caggatgctc 
cttcaggtcc acgacgagct ggtcctcgag gccccaaaag agagggcgga ggccgtggcc 
cggctggcca aggaggtcat ggagggggtg tatcccctgg ccgtgcccct ggaggtggag 
gtggggatag gggaggactg gctctccgcc aaggagtgat accacc 



<210> 154 

<211> 2496 

<212> DNA 

<213> Thermus flavus 



<400> 154 

atggcgatgc ttcccctctt tgagcccaaa ggccgcgtgc tcctggtgga cggccaccac 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2506 



60 

ctggcctacc gcaccttctt tgccctcaag ggcctcacca ccagccgcgg cgaacccgtt 120 



caggcggtct acggcttcgc caaaagcctc ctcaaggccc tgaaggagga cggggacgtg 180 
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gtggtggtgg 


tctttgacgc 


caaggccccc 


tccttccgcc 


acgaggccta 


cgaggcctac 


24 U 


aaggcgggcc 


gggcccccac 


cccggaggac 


tttccccggc 


agctggccct 


catcaaggag 


o A n 


ttggtggacc 


tcctaggcct 


tgtgcggctg 


gaggttcccg 


gctttgaggc 


ggacgacgtg 


360 


ctggccaccc 


tggccaagcg 


ggcggaaaag 


gaggggtacg 


aggtgcgcat 


cctcactgcc 


420 


gaccgcgacc 


tctaccagct 


cctttcggag 


cgcatcgcca 


tcctccaccc 


tgaggggtac 


480 


ctgatcaccc 


cggcgtggct 


ttacgagaag 


tacggcctgc 


gcccggagca 


gtgggtggac 


540 


taccgggccc 


tggcggggga 


cccctcggat 


aacatccccg 


gggtgaaggg 


catcggggag 


600 


aagaccgccc 


agaggctcat 


ccgcgagtgg 


gggagcctgg 


aaaacctctt 


ccagcacctg 


bb u 


gaccaggtga 


agccctcctt 


gcgggagaag 


ctccaggcgg 


gcatggaggc 


cctggccctt 


72 0 


tcccggaagc 


tttcccaggt 


gcacactgac 


ctgcccctgg 


aggtggactt 


cgggaggcgc 


780 


cgcacaccca 


acctggaggg 


tctgcgggct 


tttttggagc 


ggttggagtt 


tggaagcctc 


84 0 


ctccacgagt 


tcggcctcct 


ggaggggccg 


aaggcggcag 


aggaggcccc 


ctggccccct 


900 


ccggaagggg 


cttttttggg 


cttttccttt 


tcccgtcccg 


agcccatgtg 


ggccgagctt 


960 


ctggccctgg 


ctggggcgtg 


ggaggggcgc 


ctccatcggg 


cacaagaccc 


ccttaggggc 


1020 


ctgagggacc 


ttaagggggt 


gcggggaatc 


ctggccaagg 


acctggcggt 


tttggccctg 


1080 


cgggagggcc 


tggacctctt 


cccagaggac 


gaccccatgc 


tcctggccta 


ccttctggac 


1140 


ccctccaaca 


ccacccctga 


gggggtggcc 


cggcgttacg 


ggggggagtg 


gacggaggat 


1200 


gcgggggaga 


gggccctcct 


ggccgagcgc 


ctcttccaga 


ccctaaagga 


gcgccttaag 


1260 


ggagaagaac 


gcctgctttg 


gctttacgag 


gaggtggaga 


agccgctttc 


ccgggtgttg 


1320 


gcccggatgg 


aggccacggg 


ggtccggctg 


gacgtggcct 


acctccaggc 


cctctccctg 


1380 


gaggtggagg 


cggaggtgcg ccagctggag gaggaggtct 


tccgcctggc 


cggccacccc 


1440 


ttcaacctca 


actcccgcga 


ccagctggag 


cgggtgctct 


ttgacgagct 


gggcctgcct 


1500 


gccatcggca 


agacggagaa 


gacggggaaa 


cgctccacca 


gcgctgccgt 


gctggaggcc 


1560 


ctgcgagagg 


cccaccccat 


cgtggaccgc 


atcctgcagt 


accgggagct 


caccaagctc 


1620 


aagaacacct 


acatagaccc 


cctgcccgcc 


ctggtccacc 


ccaagaccgg 


ccggctccac 


1680 


acccgcttca 


accagacggc 


caccgccacg 


ggcaggcttt 


ccagctccga 


ccccaacctg 


1740 


cagaacatcc 


ccgtgcgcac 


ccctctgggc 


cagcgcatcc 


gccgagcctt 


cgtggccgag 


1800 


gagggctggg 


tgctggtggt 


cttggactac 


agccagattg 


agcttcgggt 


cctggcccac 


1860 
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ctctccgggg acgagaacct gatccgggtc tttcaggagg ggagggacat ccacacccag 
accgccagct ggatgttcgg cgtttccccc gaaggggtag accctctgat gcgccgggcg 
gccaagacca tcaacttcgg ggtgctctac ggcatgtccg cccaccgcct ctccggggag 



ctttccatcc 


cctacgagga 


ggcggtggcc 


ttcattgagc 


gctacttcca 


gagctacccc 


2100 


aaggtgcggg 


cctggattga 


ggggaccctc 


gaggagggcc 


gccggcgggg 


gtatgtggag 


2160 


accctcttcg 


gccgccggcg 


ctatgtgccc 


gacctcaacg 


cccgggtgaa 


gagcgtgcgc 


2220 


gaggcggcgg 


agcgcatggc 


cttcaacatg 


ccggtccagg 


gcaccgccgc 


cgacctcatg 


2280 


aagctggcca 


tggtgcggct 


tttcccccgg 


cttcaggaac 


tgggggcgag 


gatgcttttg 


2340 


caggtgcacg 


acgagctggt 


cctcgaggcc 


cccaaggacc 


gggcggagag ggtagccgct 


2400 


ttggccaagg 


aggtcatgga gggggtctgg 


cccctgcagg tgcccctgga ggtggaggtg 


2460 


ggcctggggg 


aggactggct 


ctccgccaag gagtag 






2496 


<210> 155 
<211> 2505 
<212> DNA 

<213> Thermus thermophilus 










<400> 155 
atggaggcga 


tgcttccgct 


ctttgaaccc 


aaaggccggg 


tcctcctggt 


ggacggccac 


60 


cacctggcct 


accgcacctt 


cttcgccctg 


aagggcctca 


ccacgagccg 


gggcgaaccg 


120 


gtgcaggcgg 


tctacggctt 


cgccaagagc 


ctcctcaagg 


ccctgaagga 


ggacgggtac 


180 


aaggccgtct 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacgag 


240 


gcctacaagg 


cggggagggc 


cccgaccccc 


gaggacttcc 


cccggcagct 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggtttacc 


cgcctcgagg 


tccccggcta 


cgaggcggac 


360 


gacgttctcg 


ccaccctggc 


caagaaggcg 


gaaaaggagg 


ggtacgaggt 


gcgcatcctc 


420 


accgccgacc 


gcgacctcta 


ccaactcgtc 


tccgaccgcg 


tcgccgtcct 


ccaccccgag 


480 


ggccacctca 


tcaccccgga 


gtggctttgg 


gagaagtacg 


gcctcaggcc 


ggagcagtgg 


540 


gtggacttcc 


gcgccctcgt 


gggggacccc 


tccgacaacc 


tccccggggt 


caagggcatc 


600 


ggggagaaga 


ccgccctcaa 


gctcctcaag 


gagtggggaa 


gcctggaaaa 


cctcctcaag 


660 


aacctggacc 


gggtaaagcc 


agaaaacgtc 


cgggagaaga 


tcaaggccca 


cctggaagac 


720 


ctcaggctct 


ccttggagct 


ctcccgggtg 


cgcaccgacc 


tccccctgga ggtggacctc 


780 


gcccaggggc 


gggagcccga 


ccgggagggg 


cttagggcct 


tcctggagag 


gctggagttc 


840 
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ggcagcctcc 


tccacgagtt cggcctcctg 


gaggcccccg 


cccccctgga 


ggaggccccc 


900 


tggcccccgc 


cggaaggggc cttcgtgggc 


ttcgtcctct 


cccgccccga 


gcccatgtgg 


960 


gcggagctta 


aagccctggc cgcctgcagg 


gacggccggg 


tgcaccgggc 


agcagacccc 


1020 


ttggcggggc 


taaaggacct caaggaggtc 


cggggcctcc 


tcgccaagga 


cctcgccgtc 


1080 


ttggcctcga 


gggaggggct agacctcgtg 


cccggggacg 


accccatgct 


cctcgcctac 


1140 


ctcctggacc 


cctccaacac cacccccgag 


ggggtggcgc 


ggcgctacgg 


gggggagtgg 


1200 


acggaggacg 


ccgcccaccg ggccctcctc 


tcggagaggc 


tccatcggaa 


cctccttaag 


1260 


cgcctcgagg 


gggaggagaa gctcctttgg 


ctctaccacg 


aggtggaaaa 


gcccctctcc 


1320 


cgggtcctgg 


cccacatgga ggccaccggg gtacggctgg acgtggccta 


ccttcaggcc 


1380 


ctttccctgg agcttgcgga ggagatccgc 


cgcctcgagg aggaggtctt 


ccgcttggcg 


1440 


ggccacccct 


tcaacctcaa ctcccgggac 


cagctggaaa gggtgctctt 


tgacgagctt 


1500 


aggcttcccg 


ccttggggaa gacgcaaaag 


acaggcaagc 


gctccaccag 


cgccgcggtg 


1560 


ctggaggccc 


tacgggaggc ccaccccatc 


gtggagaaga 


tcctccagca 


ccgggagctc 


1620 


accaagctca 


agaacaccta cgtggacccc 


ctcccaagcc 


tcgtccaccc 


gaggacgggc 


1680 


cgcctccaca 


cccgcttcaa ccagacggcc 


acggccacgg ggaggcttag tagctccgac 


1740 


cccaacctgc 


agaacatccc cgtccgcacc 


cccttgggcc 


agaggatccg 


ccgggccttc 


1800 


gtggccgagg 


cgggttgggc gttggtggcc 


ctggactata gccagataga gctccgcgtc 


1860 


ctcgcccacc 


tctccgggga cgaaaacctg 


atcagggtct 


tccaggaggg 


gaaggacatc 


1920 


cacacccaga 


ccgcaagctg gatgttcggc 


gtccccccgg 


aggccgtgga 


ccccctgatg 


1980 


cgccgggcgg 


ccaagacggt gaacttcggc 


gtcctctacg 


gcatgtccgc 


ccataggctc 


2040 


tcccaggagc 


ttgccatccc ctacgaggag gcggtggcct 


ttatagagcg 


ctacttccaa 


2100 


agcttcccca 


aggtgcgggc ctggatagaa 


aagaccctgg 


aggaggggag 


gaagcggggc 


2160 


tacgtggaaa 


ccctcttcgg aagaaggcgc 


tacgtgcccg 


acctcaacgc 


ccgggtgaag 


2220 


agcgtcaggg 


aggccgcgga gcgcatggcc 


ttcaacatgc 


ccgtccaggg 


caccgccgcc 


2280 


gacctcatga 


agctcgccat ggtgaagctc 


ttcccccgcc 


tccgggagat 


gggggcccgc 


2340 


atgctcctcc 


aggtccacga cgagctcctc 


ctggaggccc 


cccaagcgcg 


ggccgaggag 


2400 


gtggcggctt 


tggccaagga ggccatggag 


aaggcctatc 


ccctcgccgt 


gcccctggag 


2460 


gtggaggtgg ggatggggga ggactggctt tccgccaagg gttag 




2505 
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<210> 156 

<211> 2502 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<220> 

<221> n 



<222> (1) . 
<223> n is 


. (2502) 
any of the 


bases g, a 


, t, or c 








<400> 156 
atgnnggcga 


tgcttcccct 


ctttgagccc 


aaaggccggg 


tcctcctggt 


ggacggccac 


60 


cacctggcct 


accgcacctt 


cttcgccctg 


aagggcctca 


ccaccagccg 


gggcgaaccg 


120 


gtgcaggcgg 


tctacggctt 


cgccaagagc 


ctcctcaagg 


ccctgaagga 


ggacggggac 


180 


nnggcggtgn tcgtggtctt tgacgccaag gccccctcct 


tccgccacga 


ggcctacgag 


240 


gcctacaagg 


cgggccgggc 


ccccaccccg 


gaggactttc 


cccggcagct 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggcttgcg 


cgcctcgagg 


tccccggcta 


cgaggcggac 


360 


gacgtnctgg 


ccaccctggc 


caagaaggcg 


gaaaaggagg 


ggtacgaggt 


gcgcatcctc 


420 


accgccgacc 


gcgacctcta 


ccagctcctt 


tccgaccgca 


tcgccgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc gtggctttgg gagaagtacg gcctgaggcc 


ggagcagtgg 


540 


gtggactacc 


gggccctggc 


gggggacccc 


tccgacaacc 


tccccggggt 


caagggcatc 


600 


ggggagaaga 


ccgcccngaa 


gctcctcnag 


gagtggggga 


gcctggaaaa 


cctcctcaag 


660 


aacctggacc 


gggtgaagcc 


cgccntccgg gagaagatcc aggcccacat ggangacctg 


720 


angctctcct 


gggagctntc 


ccaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aagnggcggg 


agcccgaccg 


ggaggggctt 


agggcctttc 


tggagaggct 


ggagtttggc 


840 


agcctcctcc 


acgagttcgg 


cctcctggag 


ggccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtcctttccc 


gccccgagcc 


catgtgggcc 


960 


gagcttctgg 


ccctggccgc 


cgccagggag 


ggccgggtcc 


accgggcacc 


agaccccttt 


1020 


angggcctna 


gggacctnaa 


ggaggtgcgg 


ggnctcctcg 


ccaaggacct 


ggccgttttg 


1080 


gccctgaggg 


agggcctnga 


cctcntgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggacccct 


ccaacaccac 


ccccgagggg 


gtggcccggc 


gctacggggg 


ggagtggacg 


1200 


gaggangcgg 


gggagcgggc 


cctcctntcc 


gagaggctct 


tccngaacct 


nnngcagcgc 


1260 
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cttgaggggg aggagaggct cctttggctt taccaggagg tggagaagcc cctttcccgg 132 0 
gtcctggccc acatggaggc cacgggggtn cggctggacg tggcctacct ccaggccctn 13 8 0 
tccctggagg tggcggagga gatccgccgc ctcgaggagg aggtcttccg cctggccggc 1440 
caccccttca acctcaactc ccgggaccag ctggaaaggg tgctctttga cgagctnggg 
cttcccgcca tcggcaagac ggagaagacn ggcaagcgct ccaccagcgc cgccgtgctg 
gaggccctnc gngaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 
aagctcaaga acacctacat ngaccccctg ccngncctcg tccaccccag gacgggccgc 
ctccacaccc gcttcaacca gacggccacg gccacgggca ggcttagtag ctccgacccc 
aacctgcaga acatccccgt ccgcaccccn ctgggccaga ggatccgccg ggccttcgtg 
gccgaggagg gntgggtgtt ggtggccctg gactatagcc agatagagct ccgggtcctg 
gcccacctct ccggggacga gaacctgatc cgggtcttcc aggaggggag ggacatccac 
acccagaccg ccagctggat gttcggcgtc cccccggagg ccgtggaccc cctgatgcgc 1980 
cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtccgccca ccgcctctcc 
caggagcttg ccatccccta cgaggaggcg gtggccttca ttgagcgcta cttccagagc 
ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 
gtggagaccc tcttcggccg ccggcgctac gtgcccgacc tcaacgcccg ggtgaagagc 
gtgcgggagg cggcggagcg catggccttc aacatgcccg tccagggcac cgccgccgac 
ctcatgaagc tggccatggt gaagctcttc ccccggctnc aggaaatggg ggccaggatg 
ctcctncagg tccacgacga gctggtcctc gaggccccca aagagcgggc ggaggnggtg 
gccgctttgg ccaaggaggt catggagggg gtctatcccc tggccgtgcc cctggaggtg 24 60 
gaggtgggga tgggggagga ctggctctcc gccaaggagt ag 2502 

<210> 157 
<211> 832 
<212> PRT 

<213> Thermus aquaticus 
<400> 157 

Met Arg Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
15 10 15 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly 
20 25 30 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



2040 
2100 
2160 
2220 
2280 
2340 
2400 
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35 40 45 

Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He Val 
50 SB 60 

Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly 
65 70 75 80 

Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu 
85 90 95 

Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu 
100 105 110 

Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys Lys 
115 120 125 

Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys Asp 
130 135 140 

Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu Gly 
145 150 155 160 

Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro 
165 170 175 

Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn 
180 185 190 

Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu Leu 
195 200 205 

Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu 
210 215 220 

Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu Lys 
225 230 235 240 

Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu Val 
245 250 255 

Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala Asp 
305 310 315 320 

Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala Pro 
325 330 335 

Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu Leu 
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340 345 350 

Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu Pro 

355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn Leu 
405 410 415 

Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg Glu 
420 425 430 

Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val Ala 
450 455 460 

Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 520 525 

Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr 
530 535 540 

Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val Ala 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His Thr 
625 630 635 640 

Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro 
645 650 655 
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Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr Gly 
660 665 670 



Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu Glu 
675 680 685 

Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 

Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg 
725 730 735 

Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val His 
770 775 780 

Asp Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val Ala 
785 790 795 800 

Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys Glu 
820 825 830 

<210> 158 
<211> 831 
<212> PRT 

<213> Thermus flavus 
<400> 158 

Met Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Val Val Val Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
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85 



90 95 



Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Val Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Arg Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg He Ala He Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Ala Trp Leu Tyr Glu Lys Tyr Gly Leu Arg Pro Glu 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn He 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Gin Arg Leu He Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Phe Gin His Leu Asp Gin Val Lys 
210 215 220 

Pro Ser Leu Arg Glu Lys Leu Gin Ala Gly Met Glu Ala Leu Ala Leu 
225 230 235 240 

Ser Arg Lys Leu Ser Gin Val His Thr Asp Leu Pro Leu Glu Val Asp 
245 250 255 

Phe Gly Arg Arg Arg Thr Pro Asn Leu Glu Gly Leu Arg Ala Phe Leu 
260 265 270 

Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu Glu 
275 280 285 

Gly Pro Lys Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala 
290 295 300 

Phe Leu Gly Phe Ser Phe Ser Arg Pro Glu Pro Met Trp Ala Glu Leu 
305 310 315 320 

Leu Ala Leu Ala Gly Ala Trp Glu Gly Arg Leu His Arg Ala Gin Asp 
325 330 335 

Pro Leu Arg Gly Leu Arg Asp Leu Lys Gly Val Arg Gly He Leu Ala 
340 345 350 

Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly Leu Asp Leu Phe Pro 
355 360 365 

Glu Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr 
370 375 380 

Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu Asp 
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385 390 395 400 

Ala Gly Glu Arg Ala Leu Leu Ala Glu Arg Leu Phe Gin Thr Leu Lys 
405 410 415 

Glu Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp Leu Tyr Glu Glu Val 
420 425 430 

Glu Lys Pro Leu Ser Arg Val Leu Ala Arg Met Glu Ala Thr Gly Val 
435 440 445 

Arg Leu Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Val Glu Ala 
450 455 460 

Glu Val Arg Gin Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His Pro 
465 470 475 480 

Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu 
485 490 495 

Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys Arg Ser 
500 505 510 

Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro lie Val 
515 520 525 

Asp Arg lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr 
530 535 540 

lie Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Gly Arg Leu His 
545 550 555 560 

Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser 
565 570 575 

Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin Arg 
580 585 590 

lie Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Val Leu Val Val Leu 
595 600 605 

Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp 
610 615 620 

Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His Thr Gin 
625 630 635 640 

Thr Ala Ser Trp Met Phe Gly Val Ser Pro Glu Gly Val Asp Pro Leu 
645 650 655 

Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly Met 
660 665 670 

Ser Ala His Arg Leu Ser Gly Glu Leu Ser He Pro Tyr Glu Glu Ala 
675 680 685 

Val Ala Phe He Glu Arg Tyr Phe Gin Ser Tyr Pro Lys Val Arg Ala 
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690 

Trp lie Glu Gly 
705 

Thr Leu Phe Gly 



Lys Ser Val Arg 
740 

Gin Gly Thr Ala 
755 

Pro Arg Leu Gin 
770 

Glu Leu Val Leu 
785 

Leu Ala Lys Glu 



Glu Val Glu Val 
820 



695 

Thr Leu Glu Glu 
710 

Arg Arg Arg Tyr 
725 

Glu Ala Ala Glu 



Ala Asp Leu Met 
760 

Glu Leu Gly Ala 
775 

Glu Ala Pro Lys 
790 

Val Met Glu Gly 
805 

Gly Leu Gly Glu 



700 

Gly Arg Arg Arg 
715 

Val Pro Asp Leu 
730 

Arg Met Ala Phe 
745 

Lys Leu Ala Met 



Arg Met Leu Leu 
780 

Asp Arg Ala Glu 
795 

Val Trp Pro Leu 
810 

Asp Trp Leu Ser 
825 



Gly Tyr Val Glu 
720 

Asn Ala Arg Val 
735 

Asn Met Pro Val 
750 

Val Arg Leu Phe 
765 

Gin Val His Asp 



Arg Val Ala Ala 
800 

Gin Val Pro Leu 
815 

Ala Lys Glu 
830 



<210> 159 

<211> 833 

<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic 



<220> 

<221> X 

<222> (1)..{833) 

<223> X is any amino acid 



<400> 159 

Met Xaa Ala Met Leu Pro Leu Phe 
1 5 

Val Asp Gly His His Leu Ala Tyr 
20 

Leu Thr Thr Ser Arg Gly Glu Pro 
35 40 

Lys Ser Leu Leu Lys Ala Leu Lys 
50 55 

Val Phe Asp Ala Lys Ala Pro Ser 
65 70 



Glu Pro Lys Gly Arg Val Leu Leu 
10 15 

Arg Thr Phe Phe Ala Leu Lys Gly 
25 30 

Val Gin Ala Val Tyr Gly Phe Ala 

45 

Glu Asp Gly Asp Ala Val Xaa Val 
60 

Phe Arg His Glu Ala Tyr Glu Ala 
75 80 



615 



Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu 
85 90 ^5 

Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Xaa Arg Leu Glu 
100 105 110 

Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys 
115 120 125 

Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp 
130 135 140 

Leu Tyr Gin Leu Leu Ser Asp Arg He Ala Val Leu His Pro Glu Gly 
145 150 155 160 

Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro 
165 170 175 

Glu Gin Trp Val Asp Tyr Arg Ala Leu Xaa Gly Asp Pro Ser Asp Asn 
180 185 190 

Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Xaa Lys Leu Leu 
195 200 205 

Xaa Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp Arg Val 
210 215 220 

Lys Pro Xaa Xaa Arg Glu Lys He Xaa Ala His Met Glu Asp Leu Xaa 
225 230 235 240 

Leu Ser Xaa Xaa Leu Ser Xaa Val Arg Thr Asp Leu Pro Leu Glu Val 
245 250 255 

Asp Phe Ala Xaa Arg Arg Glu Pro Asp Arg Glu Gly Leu Arg Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Xaa Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Leu Ala Leu Ala Ala Ala Arg Xaa Gly Arg Val His Arg Ala Xaa 
325 330 335 

Asp Pro Leu Xaa Gly Leu Arg Asp Leu Lys Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly Leu Asp Leu Xaa 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 



616 



Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 



Asp Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu Phe Xaa Asn Leu 
405 410 415 

Xaa Xaa Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Xaa Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Val Ala 
450 455 460 

Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 520 525 

Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Asn Thr 
530 535 540 

Tyr He Asp Pro Leu Pro Xaa Leu Val His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Xaa Leu Val Ala 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His Thr 
625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Pro 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu 
675 680 685 



617 



Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 



Ala Trp lie Glu Lys 
705 

Glu Thr Leu Phe Gly 
725 

Val Lys Ser Val Arg 
740 

Val Gin Gly Thr Ala 
755 

Phe Pro Arg Leu Xaa 
770 

Asp Glu Leu Val Leu 
785 

Ala Leu Ala Lys Glu 
805 

Leu Glu Val Glu Val 
820 

Xaa 



<210> 160 
<211> 640 
<212> RNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 160 

gggagcccag cuaugaacuc cuucuccaca agcgccuucg guccaguugc cuucucccug 60 

gggcugcucc ugguguugcc ugcugccuuc ccugccccag uacccccagg agaagauucc 120 

aaagauguag ccgccccaca cagacagcca cucaccucuu cagaacgaau ugacaaacaa 18 0 

auucgguaca uccucgacgg caucucagcc cugagaaagg agacauguaa caagaguaac 240 

augugugaaa gcagcaaaga ggcacuggca gaaaacaacc ugaaccuucc aaagauggcu 3 00 

gaaaaagaug gaugcuucca aucuggauuc aaugaggaga cuugccuggu gaaaaucauc 3 60 

^cuggucuuu uggaguuuga gguauaccua gaguaccucc agaacagauu ugagaguagu 42 0 

gaggaacaag ccagagcugu ccagaugagu acaaaagucc ugauccaguu ccugcagaaa 4 80 

aaggcaaaga aucuagaugc aauaaccacc ccugacccaa ccacaaaugc cagccugcug 540 



Thr Leu Glu Glu Gly 
710 

Arg Arg Arg Tyr Val 
730 

Glu Ala Ala Glu Arg 
745 

Ala Asp Leu Met Lys 
760 

Glu Met Gly Ala Arg 
775 

Glu Ala Pro Lys Xaa 
790 

Val Met Glu Gly Val 
810 

Gly Xaa Gly Glu Asp 
825 



Arg Arg Arg Gly Tyr Val 
715 720 

Pro Asp Leu Asn Ala Arg 
735 

Met Ala Phe Asn Met Pro 
750 

Leu Ala Met Val Lys Leu 
765 

Met Leu Leu Gin Val His 
780 

Arg Ala Glu Xaa Val Ala 
795 800 

Tyr Pro Leu Ala Val Pro 
815 

Trp Leu Ser Ala Lys Glu 
830 



618 



acgaagcugc aggcacagaa ccaguggcug caggacauga caacucaucu cauucugcgc 600 
agcuuuaagg aguuccugca guccagccug agggcucuuc 64 0 



<210> 161 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 161 

agggagaagg caactggacc gaaggcc 2 7 



<210> 162 

<211> 35 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (1) . . (1) 

<223> TH 5' end has a fluorescein label. 



<220> 

<221> n 

<222> (35) . . (35) 

<223> TH 3' end is modified with a dideoxynucleotide , 



<400> 162 

ncgaaattaa tacgcttgtg gagaaggagt tcatn 35 



<210> 163 

<211> 53 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 163 

gctatgaact ccttctccac aagcgccttc ggtccagttg ccttctccct ggg 53 



<210> 164 

<211> 214 

<212> PRT 

<213> Artificial 
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<220> 

<223> Synthetic 



<400> 164 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
15 10 15 

Glu Trp Thr Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu 
20 25 30 

Phe Ala Asn Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp 
35 40 45 

Leu Tyr Arg Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met 
50 55 60 

Glu Ala Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser 
65 70 75 80 

Leu Glu Val Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg 
85 90 95 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
100 105 110 

Val Leu Phe Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys 
115 120 125 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
130 135 140 

Ala His Pro lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys 
145 150 155 160 

Leu Lys Ser Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg 
165 170 175 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
180 185 190 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
195 200 205 

Pro Leu Gly Gin Arg lie 
210 

<210> 165 

<211> 214 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 165 



6Z0 



Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
1 5 10 15 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
20 25 30 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
35 40 45 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
50 55 60 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
65 70 75 80 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
85 90 95 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
100 105 110 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
115 120 125 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
130 135 140 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
145 150 155 160 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
165 170 175 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
180 185 190 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
195 200 205 

Pro Leu Gly Gin Arg He 
210 

<210> 166 

<211> 37 

<212> RNA 

<213> Artificial 

<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (1)..(1) 

<223> TH 5* end is modified with a biotin-streptavitin complex. 



621 



<400> 166 

nucacggcag uuggugcgcc ucggaacgag gcgcacg 



37 



<210> 167 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (1) , . (1) 

<223> TH 5' end is labeled with tetrachlororf luorescein . 



<400> 167 

nttttcaact gccgtga 17 



<210> 168 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 168 

tcacggcagt tggtgcgcct cggaacgagg cgcacg 3 6 



<210> 169 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 169 

cgccgagatc acctttacat tttctatcgt 3 0 



<210> 170 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 170 

ccttccttat cctggatctt ggca 24 



622 



<210> 171 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 171 

acgatagaaa atgtaaaggt gate 



<210> 172 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (4) . . (4) 

<223> The residue at this position is a z2 8 linker group. 



<400> 172 
ctcnttctca gtgcg 



<210> 173 

<211> 29 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 173 

cgcagtgaga atgaggtgat ctcggcggt 



<210> 174 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 

<400> 174 

ccgccgagat cacggatgtt gtaatcagag a 



<210> 175 
<211> 21 
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<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 175 

gtgcagggtt gactccttct c 



<210> 176 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 176 

gtgcagggtt gactctttct c 



<210> 177 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 177 

gtgcagggtc gactctttct c 



<210> 178 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 178 

tctctgatta caacatccgt gatct 



<210> 179 

<211> 29 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 179 

cgccgagatc acgtagttga ggtcaatga 



<210> 180 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 180 

gaatcatact ggaacatgta gaccatc 



<210> 181 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 181 

tcattgacct caactacgtg atct 



<210> 182 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 182 

ccgccgagat cacgatgatc ttgaggct 



<210> 183 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 183 

tggtgcagga ggcattgctc 



<210> 184 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 184 

cagcctcaag attaccgtga tct 



<210> 185 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 185 

ccgtcacgcc tcctccacgg etc 



<210> 186 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 186 

aggcgaaagc cctcaatttc cca 



<210> 187 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 187 
aaccactgcc gcaca 



<210> 188 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 188 
gagccgtgga ggaggcg 



<210> 189 

<211> 14 

<212> DNA 

<213> Artificial 



<220> 
<223> 



Synthetic 



<220> 

<221> n 

<222> (4).. (4) 

<223> The residue at this position is a z28 linker group. 



<400> 189 
cacntgcttc gtgg 



<210> 190 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 190 

ccaggaagca agtggaggcg tgacggt 



<210> 191 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 191 

ccgtcacgcc tccttcggag tttggg 



<210> 192 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 192 

gggttgtgga gtgagtgttc aagta 



<210> 193 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 193 

gggaaactcc gaaggaggcg 



<210> 194 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 194 

ccgtcacgcc tctctgactg cca 



<210> 195 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 195 

ttgtcactcg gggttcgaga agatgaa 



<210> 196 

<211> 11 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 196 
gggccagagg g 

<210> 197 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 197 
aggcagtcag agaggcg 



<210> 198 

<211> 26 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 198 

ccgtcacgcc tcctcctcat tgaatt 



<210> 199 

<211> 35 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 199 

ccaaaagtcc agtgatgatt ttcaccaggc aagta 



<210> 200 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 200 

cagattggaa gcatccatct 



<210> 201 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 201 

gattcaatga ggaggaggc 



<210> 202 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 202 

ccgtcacgcc tccatctgtt tagg 



<210> 203 
<211> 22 



<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 203 

caggtcctgg aaggagcact ta 



<210> 204 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 204 

gccatcagct tctttgttct tgtcatc 



<210> 205 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 205 

gccctaaaca gatggaggcg 

<210> 206 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 206 

ccgtcacgcc tcctccagtt gtag 



<210> 207 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 207 

aaaatcatct gtaaatccag cagtaaatga 



<210> 208 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 208 

ctgtgttttc tttgtagaac 



<210> 209 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 209 
ctacaactgg aggaggc 



<210> 210 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 210 

ccgtcacgcc tcctctcagt tct 



<210> 211 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 211 

gtgtggtcca ctctcaatca a 



<210> 212 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 



<220> 

<221> modif ied_base 

<222> (1) . , (1) 

<223> The residue at tH position contains a TET-label. 



<400> 212 

attagaaagg aagggaagaa agcgaa 26 



<210> 213 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 213 

gcttgacggg gaaagccggc gaacgtggcg 3 0 



<210> 214 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 214 

cttgacgggg aaagccggcg aacgtggcga 3 0 



<210> 215 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 215 

tgacggggaa agccggcgaa cgtggcgaga 3 0 



<210> 216 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 216 

^^^gggg^aag ccggcgaacg tggcgagaaa 3 0 



632 



<210> 217 

<211> 53 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 217 

ttcgctttct tcccttcctt tctcgccacg ttcgccggct ttccccgtca age 53 



<210> 218 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<221> modif ied_base 

<222> (1) . . (1) 

<223> The residue at tH positions contains a fluorescein label. 



<400> 218 

tttccctcct cctcttcc 18 



<210> 219 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 219 

acacagtgtc ctcccgctcc tcctgagcaa 3 0 



<210> 220 

<211> 54 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 220 

atgaggaaga ggaggagggt gctcaggagg agcgggagga cactgtgtct gtca 54 



<210> 221 
<211> 840 



633 



<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 221 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
IS 10 1^ 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Ala Val 
50 55 60 

He Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr 
65 70 75 80 

Gly Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg 
85 90 95 

Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg 
100 105 110 

Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala 
115 120 125 

Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp 
130 135 140 

Lys Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro 
145 150 155 160 

Glu Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu 
165 170 175 

Arg Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser 
180 185 190 

Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu Lys 
195 200 205 

Leu Leu Lys Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp 
210 215 220 

Arg Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp 
225 230 235 240 

Leu Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys 



634 



260 265 270 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro Pro Pro 
290 295 300 

Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp 
305 310 315 320 

Ala Asp Leu Leu Ala Leu Ala Ala Cys Arg Gly Gly Arg Val His Arg 
325 330 335 

Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 
340 345 350 

Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 
355 360 365 

Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 
370 375 380 

Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp 
385 390 395 400 

Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg 
405 410 415 

Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 
450 455 460 

Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 
515 520 525 

Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys 
530 535 540 

Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 



635 



565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 
595 600 605 

Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie 
625 630 635 640 

His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val 
645 650 655 

Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr 
675 680 685 

Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 
725 730 735 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 
770 775 780 

Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
785 790 795 800 

Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 
820 825 830 

Lys Gly His His His His His His 
835 840 

<210> 222 

<211> 2520 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 222 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


D \J 


ggccaccacc 


tggcctaccg 


tacctttttt 


gccctgaagg 


gcctcaccac 


cagccggggg 


xz u 


gagccggtcc 


aggcggtgta 


cgggtttgcc 


aagagccttt 


tgaaggcgct 


aagagaagac 


IPO 
J. o u 


ggggacgcgg 


tgatcgtggt 


ctttgacgcc 


gaggccccct 


ccttccgcca 


cgaggcctac 


^^\J 


ggggggtaca 


aggcggggcg 


ggctcccacc 


cccgaggact 


ttccccggca 




3 00 


atcaaggagc 


tggtggacct 


cctggggttt 


acccgcctcg 


aggtccccgg 


cuacgaggcg 


O O VJ 


gacgacgttc 


tcgccaccct 


ggccaagaag 


gcggaaaagg 


aggggtacga 


ggtgcgcatc 




ctcaccgccg 


acaaagacct 


ttaccagctc 


ctttccgacc 


gcatccacgt 


cctccacccc 


4 o U 


gaggggtacc 


tcatcacccc 


ggcctggctt 


tgggaaaagt 


acggcctgag 


gcccgaccag 


A Pi 


tgggccgact 


accgggccct 


gaccggggac 


gagtccgaca 


accttcccgg 


ggtcaagggc 


A Pi 
D U U 


atcggggaga 


agaccgccct 


caagctcctc 


aaggagtggg 


ggagcctgga 


agccctcctc 


£1 {Z r\ 
o o U 


aagaacctgg 


accggctgaa 


gcccgccatc 


cgggagaaga 


tcctggccca 


catggacgat 


no r\ 

f A\J 


ctgaagctct 


cctgggacct 


ggccaaggtg 


cgcaccgacc 


tgcccctgga 


ggtggacttc 


n o n 


gccaaaaggc 


gggagcccga 


ccgggagggg 


cttaaggcct 


ttttggagag gctggagttc 


Q A r\ 
o4 U 


ggcagcctcc 


tccacgagtt 


cggcctcctg 


ggaggggaga 


agccccggga 


ggaggccccc 


900 


tggcccccgc 


cggaaggggc 


cttcgtgggc 


tttgtgcttt 


cccgcaagga 


gcccatgtgg 


Q n 
y b U 


gccgatcttc 


tggccctggc 


cgcctgcagg 


ggcggccgcg 


tgcaccgggc 


agcagacccc 


102 0 


ttggcggggc 


taaaggacct 


caaggaggtc 


cggggcctcc 


tcgccaagga 


cctcgccgtc 


1080 


ttggcctcga gggaggggct 


agacctcgtg 


cccggggacg 


accccatgct 


cctcgcctac 


1140 


ctcctggacc 


cttcgaacac 


cacccccgag 


ggggtggcgc 


ggcgctacgg 


gggggagtgg 


1200 


acggaggacg 


ccgcccaccg 


ggccctcctc 


tcggagaggc 


tccatcggaa 


cctccttaag 


1260 


cgcctcgagg 


gggaggagaa 


gctcctttgg 


ctctaccacg 


aggtggaaaa 


gcccctctcc 


1320 


cgggtcctgg 


cccatatgga 


ggccaccggg 


gtacggcggg 


acgtggccta 


ccttcaggcc 


1380 


ctttccctgg 


agcttgcgga ggagatccgc 


cgcctcgagg 


aggaggtctt 


ccgcttggcg 


1440 


ggccacccct 


tcaacctcaa 


ctcccgggac 


cagctggaaa gggtgctctt 


tgacgagctt 


1500 


aggcttcccg 


ccttgaagaa 


gacgaagaag 


acaggcaagc 


gctccaccag 


cgccgcggtg 


1560 



637 



ctggaggccc 


tacgggaggc 


ccaccccatc 


gtggagaaga 


tcctccagca 


ccgggagctc 


accaagctca 


agaacaccta 


cgtggacccc 


ctcccaagcc 


tcgtccaccc 


gaggacgggc 


cgcctccaca 


cccgcttcaa 


ccagacggcc 


acggccacgg 


ggaggcttag 


tagctccgac 


cccaacctgc 


agaacatccc 


cgtccqcacc 


cccttgggcc 


agaggatccg 


ccgggccttc 


gtggccgagg 


cgggttgggc 


gttggtggcc 


ctggactata 


gccagataga 


gctccgcgtc 


ctcgcccacc 


tctccgggga 


cgaaaacctg 


atcagggtct 


tccaggaggg 


gaaggacatc 


cacacccaga 


ccgcaagctg 


gatgttcggc 


gtccccccgg 


aggccgtgga 


ccccctgatg 


cgccgggcgg 


ccaagacggt 


gaacttcggc 


gtcctctacg 


gcatgtccgc 


ccataggctc 


tcccaggagc 


ttgccatccc 


ctacgaggag 


gcggtggcct 


ttatagagcg 


ctacttccaa 


agcttcccca 


^9gtgcgggc 


ctggatagaa 


aagaccctgg 


^ggaggggag 


gaagcggggc 


tacgtggaaa 


ccctcttcgg 


aagaaggcgc 


tacgtgcccg 


acctcaacgc 


ccgggtgaag 


agcgtcaggg 


aggccgcgga 


gcgcatggcc 


ttcaacatgc 


ccgtccaggg 


caccgccgcc 


gacctcatga 


agctcgccat 


ggtgaagctc 


ttcccccgcc 


tccgggagat 


gggggcccgc 


atgctcctcc 


aggtcgccaa 


cgagctcctc 


ctggaggccc 


cccaagcgcg 


ggccgaggag 


gtggcggctt 


tggccaagga 


ggccatggag 


aaggcctatc 


ccctcgccgt 


gcccctggag 


gtggaggtgg 


ggatggggga 


ggactggctt 


tccgccaagg 


gtcaccacca 


ccaccaccac 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



<210> 223 

<211> 16 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (1) . . (1) 

<223> TH 5' end has a fluorescein label. 



<220> 

<221> n 

<222> (6) . . (6) 

<22 3> The residue at tH position is a cy3 abasic linker group. 



<400> 223 

ncgctntctc gctcgc 16 
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<210> 224 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 224 

"i fl 

acggaacgag cgtctttg 



<210> 225 

<211> 32 

<212> RNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 225 

gcgagcgaga cagcgaaaga cgcucguucc gu 



<210> 226 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 

<400> 226 

gcgagcgaga cagcgaaaga cgctcgttcc gt 



<210> 227 

<211> 29 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 227 

acggaacgag cgtctttcat ctgtcaatc 



<210> 228 

<211> 26 

<212> RNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 228 
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ucacggcagu uggugcggaa cgcacg 



26 



<210> 229 

<211> 26 

<212> DMA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 229 

tcacggcagt tggtgcggaa cgcacg 



<210> 230 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (30) . . (30) 

<223> TH 3' end is modified with an amine moiety . 



<400> 230 

cggaggaagc agttggtgcg cctcgttaan 



<210> 231 

<211> 23 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (1)..(1) 

<223> TH 5' end is labeled with fluorescein. 



<400> 231 

ntccttctca actgcttcct ccg 



<210> 232 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 
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<223> Synthetic 



<220> 

<221> n 

<222> (28).. (28) 

<223> TH 3' end is modified with a biotin moiety. 



<400> 232 

aacgaggcgc acctcaaatc tccctttn 



<210> 233 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (1) . . (1) 

<223> TH 5' end is labeled with fluorescein. 



<400> 233 

nagcgagaca gcgaaagacg ctcgtt 



<210> 234 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (1) . . (1) 

<223> TH 5' end is labeled with fluorescein. 



<400> 234 
nttttcgctg tctcgct 



<210> 235 

<211> 13 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 235 
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acgagcgtct ttg 



13 



<210> 236 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 236 

cacgaattcg gggatgctgc ccctctttga gcccaa 



<210> 237 

<211> 34 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 237 

gtgagatcta tcactccttg gcggagagcc agtc 



<210> 238 

<211> 2502 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 238 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 



36 



34 



caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 



60 
120 
180 
240 
300 
360 
420 
480 



600 
660 
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aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 






aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


7 8 0 




aaaaggcggg 


agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 


o *± u 




agcctcctcc 


acgagt tcgg 


U H.^ L.y y ctd 


^5 cr p p r* p a aacf 


ccctggagga 


ggccccctgg 


Q n n 




cccccgccgg 


aaggggcctt 


eg uggy c u c 


rrirrP'h'h'hPPP 


^ d. v^^j Z3 


catqtQQQCC 






gatcttctgg 


ccctggccgc 


cgccaggggg 


gyccyyy 


^ ^ ^ ^ 


cgagccttat 


T AO n 
X uz u 




aaagccctca 


gggacctgaa 


99a99cgcgg 


yyyt^ ^'^^^y 


ppa aaaacc t 


aaacattctq 


1 n ft n 
X u o u 




gccctgaggg 


aaggccttgg 


CCCCCCy CCC 




p p a t" ac t c c t 


cgcctacctc 


X -L VJ 




ctggaccctt 


ccaacaccac 


ccccgagggg 


rrl- nnp P PPTPTP 

guyguoL-yy 




aqaqtaqacq 


1 9 n fi 

XZ VJ u 




gaggaggcgg 


gggagcgggc 


/-*■/-! +— +~ 1^ 

cgcccu cccc 


/T^narrf^pl" pt" 
y dyciy y uv- u 


tcgccaacct 


QtQQqqqaqq 


xz o u 




cttgaggggg 


aggagaggct 


^ ^ 4~ ^ v^y^/^ ^ 

ccL-uuggcc u 


uaccgygayy 


*-yy *^y "-y y ^ 


cctttccgct 


T "3 o n 

X J z u 


"f I 


gtcctggccc 


acatggaggc 


cacgggggtg 


cgcctggacg 


^ nr^ r* r*\~ ^ t" pi~ 
L.yyo*--t-clL.^— 


p a rrcra p p t fc q 


1 7 ft ri 
X J o u 


IP 


tccctggagg 


tggccgggga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1 A A n 
x^ ^ u 




caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


T c; Pi n 
X-> u u 


TV i; 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


'i H 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


N= 


aagctgaaga gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


Xo o U 




ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 




aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga ggatccgccg ggccttcatc 


n Q A A 




gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


T o A 




gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 




acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 




cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2 040 




caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 




ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 




gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 




gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccggggcac 


cgccgccgac 


2280 




ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 
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ctccttcagg tccacgacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 240 0 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 24 60 
gaggtgggga taggggagga ctggctctcc gccaaggagt ga 2502 



<210> 239 

<211> 833 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 239 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
SO 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
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210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Gly Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 



645 



515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Arg Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asp Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
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Glu 



820 



825 



830 



<210> 240 

<211> 28 

<212> DMA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 240 

cacgaattcc gaggcgatgc ttccgctc 



<210> 241 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 

<400> 241 

tcgacgtcga ctaacccttg gcggaaagcc 



<210> 242 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 242 

gcatcgcctc ggaattcatg gtc 



<210> 243 

<211> 836 

<212> PRT 

<213> Thermus thermophilus 

<400> 243 

Met Asn Ser Glu Ala Met Leu Pro 
1 5 

Leu Leu Val Asp Gly His His Leu 
20 

Lys Gly Leu Thr Thr Ser Arg Gly 
35 40 



Leu Phe Glu Pro Lys Gly Arg Val 
10 15 

Ala Tyr Arg Thr Phe Phe Ala Leu 
25 30 

Glu Pro Val Gin Ala Val Tyr Gly 
45 
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Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 



Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 
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Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 
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Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asp Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 

<210> 244 
<211> 2511 
<212> DNA 

<213> Thermus thermophilus 
<400> 244 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 3 00 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 42 0 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 4 80 
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cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 54 0 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 60 0 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 66 0 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 72 0 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 84 0 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccqcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcagggacg gccgggtgca ccgggcagca 1020 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 114 0 

gcctacctcc tggacccctc caacaccacc cccgaggggg tggcgcggcg ctacgggggg 12 00 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca catggaggcc accggggtac ggcgggacgt ggcctacctt 13 80 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt ggggaagacg caaaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

^cgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 174 0 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 192 0 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 198 0 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2 04 0 

^^gctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 
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cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 22 80 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt ccacgacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtta g 2511 



<210> 245 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 245 

atagccatgg tggagcggcc gctctcccgg 3 0 



<210> 246 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 246 

aagcgtcgac tcaatcctgc ttcgcctcca gcc 33 

<210> 247 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 247 

aatcgaattc accccacttt ttgacctgga gg 32 



<210> 248 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 248 

cc999^9^9^ ggccgctcca c 



21 



<210> 249 
<211> 2508 
<212> DMA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 249 

atggaattca ccccactttt tgacctggag gaacccccca agcgggtgct tctggtggac 60 

ggccaccacc tggcctaccg caccttctat gccctgagcc tcaccacctc ccggggggag 12 0 

ccggtgcaga tggtctacgg cttcgcccgg agcctcctca aggccttgaa ggaggacgga 18 0 

caggcggtgg tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacgag 240 

gcctacaagg cgggccgggc ccccaccccg gaggacttcc cccgccagct cgccttggtc 3 00 

aagcggctgg tggaccttct gggcctggtc cgcctcgagg ccccggggta cgaggcggac 3 60 

gacgtcctgg gcaccctggc caagaaggcc gaaagggagg ggatggaggt gcgcatcctc 42 0 

acgggagacc gggacttctt ccagctcctc tccgagaagg tctcggtcct cctgccggac 480 

gggaccctgg tcaccccaaa ggacgtccag gagaagtacg gggtgccccc ggagcgctgg 54 0 

gtggacttcc gcgccctcac gggggaccgc tcggacaaca tccccggggt ggcggggata 600 

ggggagaaga ccgcccttcg actcctcgca gagtggggga gcgtggaaaa cctcctgaag 66 0 

aacctggacc gggtaaagcc ggactcgctc cggcgcaaga tagaggcgca cctcgaggac 72 0 

ctccacctct ccttagacct ggcccgcatc cgcaccgacc tccccctgga ggtggacttt 780 

aaggccctgc gccgcaggac ccccgacctg gagggcctga gggccttttt ggaggagctg 840 

gagttcggaa gcctcctcca cgagttcggc ctcctgggag gggagaagcc ccgggaggag 900 

gccccctggc ccccgcccga aggggccttc gtgggcttcc tcctttcccg caaggagccc 960 

atgtgggcgg agcttctggc cctggcggcg gcctcggagg gccgggtcca ccgggcaaca 102 0 

agcccggttg aggccctggc cgacctcaag gaggcccggg ggttcctggc caaggacctg 1080 

gccgttttgg ccctgcggga gggggtggcc ctggacccca cggacgaccc cctcctggtg 1140 

gcctacctcc tggacccggc caacacccac cccgaggggg tggcccggcg ctacgggggc 12 0 0 

gagttcacgg aggacgcagc ggagagggcc ctcctctccg agaggctctt ccagaacctc 12 60 

tttccccggc tttccgagaa gctcctctgg ctctaccagg aggtggagcg gcccctctcc 132 0 
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cgggtcttgg 


cccacatgga 


ggcccggggg 


gtgaggctgg 


acgtccccct 


tctggaggcc 


ctctcctttg 


agctggagaa 


ggagatggag 


cgcctggagg 


gggaggtctt 


ccgtttggcc 


ggccacccct 


tcaacctcaa 


ctcccgcgac 


cagctggaaa 


gggtcctctt 


tgacgagctg 


ggcctcaccc 


cggtgggccg 


gacggagaag 


acgggcaagc 


gctccaccgc 


ccagggggcc 


ctggaggccc 


tccggggggc 


ccaccccatc 


gtggagctca 


tcctccagta 


ccgggagctt 


tccaagctca 


aaagcaccta 


cctggacccc 


ctgccccggc 


tcgtccaccc 


gcggacgggc 


cggctccaca 


cccgcttcaa 


ccagacggcc 


acggccacgg 


gaaggctttc 


cagctccgac 


cccaacctgc 


agaacatccc 


cgtgcgcacc 


cccttggggc 


agcgcatccg 


caaggccttc 


gtggccgagg 


aggggtggct 


ccttttggcg 


gcggactact 


cccagattga 


gctccgggtc 


ctggcccacc 


tctcggggga 


cgagaacctg 


aagcgggtct 


tccgggaggg 


gaaggacatc 


cataccgaga 


ccgccgcctg 


gatgttcggc 


ttagaccccg 


ctctggtgga 


tccaaagatg 


cgccgggcgg 


ccaagacggt 


caacttcggc 


gtcctctacg 


ggatgtccgc 


ccacaggctc 


tcccaggagc 


tcggcataga 


ctacaaggag 


gcggaggcct 


ttattgagcg 


ctacttccag 


agcttcccca 


^9gtgcgggc 


ctggatagaa 


aggaccctgg 


aggagggccg 


gacgcggggc 


tacgtggaga 


ccctgttcgg 


caggaggcgc 


tatgtgcccg 


acctggcctc 


ccgggtccgc 


tcggtgcggg 


aggcggcgga 


gcggatggcc 


ttcaacatgc 


ccgtgcaggg 


caccgccgcc 


gacctgatga 


agatcgccat 


ggtcaagctc 


ttccccaggc 


taaagcccct 


gggggcccac 


ctcctcctcc 


aagtgcacga 


cgagctggtc 


ctggaggtgc 


ccgaggaccg 


ggccgaggag 


gccaaggccc 


tggtcaagga 


ggtcatggag 


aacgcctacc 


ccctggacgt 


gcccctcgag 


gtggaggtgg 


gcgtgggtcg 


ggactggctg 


gaggcgaagc 


aggattga 





<210> 250 

<211> 835 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 250 

Met Glu Phe Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2508 
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Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg He Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn He Pro Gly Val Ala Gly He Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys He Glu Ala His Leu Glu Asp 
225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg He Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Glu Gly Arg Val 
325 330 335 
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His Arg Ala Thr Ser Pro Val Glu Ala Leu Ala Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Phe Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Val Ala Leu Asp Pro Thr Asp Asp Pro Leu Leu Val Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ala Asn Thr His Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Phe Thr Glu Asp Ala Ala Glu Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

Phe Gin Asn Leu Phe Pro Arg Leu Ser Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

Gin Glu Val Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Arg Gly Val Arg Leu Asp Val Pro Leu Leu Glu Ala Leu Ser Phe Glu 
450 455 460 

Leu Glu Lys Glu Met Glu Arg Leu Glu Gly Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Gly Leu Thr Pro Val Gly Arg Thr Glu Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ala Gin Gly Ala Leu Glu Ala Leu Arg Gly Ala His 
515 520 525 

Pro He Val Glu Leu He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys 
530 535 540 

Ser Thr Tyr Leu Asp Pro Leu Pro Arg Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg He Arg Lys Ala Phe Val Ala Glu Glu Gly Trp Leu Leu 
595 600 605 

Leu Ala Ala Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu Lys Arg Val Phe Arg Glu Gly Lys Asp He 
625 630 635 640 
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His Thr Glu Thr Ala Ala Trp Met Phe Gly Leu Asp Pro Ala Leu Val 
645 650 655 

Asp Pro Lys Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Gly He Asp Tyr 
675 680 685 

Lys Glu Ala Glu Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp He Glu Arg Thr Leu Glu Glu Gly Arg Thr Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala 
725 730 735 

Ser Arg Val Arg Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys He Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Lys Pro Leu Gly Ala His Leu Leu Leu Gin 
770 775 780 

Val His Asp Glu Leu Val Leu Glu Val Pro Glu Asp Arg Ala Glu Glu 
785 790 795 800 

Ala Lys Ala Leu Val Lys Glu Val Met Glu Asn Ala Tyr Pro Leu Asp 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Val Gly Arg Asp Trp Leu Glu Ala 
820 825 830 

Lys Gin Asp 
835 

<210> 251 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 251 

actggaattc ctgcccctct ttgagcccaa g 



<210> 252 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 
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<223> Synthetic 

<400> 252 

aacagtcgac ctaggccttg gcggaaagcc 3 0 

<210> 253 

<211> 2499 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 





<400> 253 
atggaattcc 


tgcccctctt 


tgagcccaag 


qaccaqatqc 


ttctqqtqqa 


cggccaccac 


60 




ctggcctacc 


gtaccttttt 


tgccctgaag 


ggcctcacca 


ccagccgcqq 


ggagccggtc 


120 


fr 


caggcggtgt 


acQQcrtfctQC 


caagagcct t 


ttQaaqacqc 


taagggaaga 


cqqgqatqtg 


180 


;rcS! 


ataatccrtacr 


tgtt tgacgc 


caaggccccc 


tccttccgcc 


accagaccta 


cqaqqcctac 


240 






aocrc t p c c a c 

^ ^ W ^ 'w w 


ccccgaggac 


tttccccggc 


agcttgccct 


tatcaaggag 


300 




y y ^y y 


f" i" t" "f" fTfTrr f~ 


y y ^y ^y ^ ^ ^ 


y<=iyy ^-y t-^-yy 


Qpt t tcfaaQC 


ggatgacgtc 


360 




yy L-cLw^w-o 


i" nrrr' r* ;5 ^3 era 3 
L-yyt^i^-ctctycici. 


yyVi—yyctdctcty 




dCL^ ^ 1^ C3. 


cctcaccgcg 


420 


Sj 


n ^ f~> rtr^ci^ c 


L-l— L-CtV_,^CtyL-.L- 


uv^uL-L-k^yyciy 


V- y d ci 1— v_, l-vj^^ci 


<_ v_, v_ i_ cx ^ \_- 


y y "-y yy 


480 




L,yciut^ctv^t..o 


rfn^ n i~ rrrr +" 

^yy^y *-yy^ *- 


t, (-y yy cty dcty 


L-dL-yyyoL-i-ci. 




y ^yyy '-yy 


540 




cciv_ i_.y yy (^v_ u 


cggccgggga 


/-I ■)— f- /-I (^iTa 
^^t,^^ui^^^ycn^ 


cid*.— duv— v^^^i— y 


y^y *-y"-*-*-yyy 


(.^dL-iw-yyyy dy 


600 




aagacggcgg 


ccaagctgat 


ccgggagtgg 


ggaagcctgg 


aaaaccttct 


taagcacctg 


660 




gaacaggtga 


aacctgcctc 


cgtgcgggag 


aagatcctta 


gccacatgga 


ggacctcaag 


720 




ctatccctgg 


agctatcccg 


ggtgcacacg 


gacttgctcc 


ttcaggtgga 


cttcgcccgg 


780 




cgccgggagc 


cggaccggga 


ggggcttaag 


gcctttttgg 


agaggctgga 


gttcggaagc 


840 




ctcctccacg 


agttcggcct 


gttggaaagc 


ccggtggcgg 


cggaggaagc 


tccctggccg 


900 




ccccccgagg 


gagccttcgt 


ggggtacgtt 


ctttcccgcc 


ccgagcccat 


gtgggcggag 


960 




cttaacgcct 


tggccgccgc 


ctgggaggga 


agggtttacc 


gggcggagga 


tcccttggag 


1020 




gccttgcggg 


ggcttgggga 


ggtgaggggg 


cttttggcca 


aggacctggc 


ggtgctggcc 


1080 




ctgagggaag 


ggattgccct 


ggcaccgggc 


gacgacccca 


tgctcctcgc 


ctacctcctg 


1140 




gatccttcca 


acaccgcccc 


cgaaggggta 


gcccggcgct 


acggggggga 


gtggaccgag 


1200 




gaggcggggg 


aaagggcgct 


gctttccgaa 


aggctttacg 


ccgccctcct 


ggagcggctt 


1260 
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aagggggagg agaggcttct ttggctttac gaggaggtgg aaaagcccct ttcgcgggtc 132 0 
ctggcccaca tggaggccac gggggtacgg ttggatgtgg cctacttaaa ggccctttcc 13 80 
ctggaggtgg aggcggagat aaggcgcttc gaggaggagg tccaccgcct ggccgggcat 
cctttcaacc tgaactcccg ggaccagctg gaaagggtca tctttgacga gcttgggctt 
cccgccatcg gcaagacgga gaagacgggc aagcgctcca ccagcgccgc cgttttggag 
gccttgcggg aggctcatcc catcgtggac cgcatccttc agtaccggga gctttccaag 
ctcaagggaa cctacatcga tcccttgcct gccctggtcc accccaagac gaaccgcctc 
cacacccgtt tcaaccagac ggccaccgcc acggggaggc ttagcagctc ggatcctaat 
ctgcaaaata tccccgtgcg cacccctttg ggccagcgga tccgccgggc cttcgtggcc 
gaggaggggt ggaggctggt ggttttggac tacagccaga ttgagctcag ggtcctggcg 1860 
cacctttccg gggacgagaa cctaatccgg gtcttccagg agggccagga catccacacc 
cagacggcca gctggatgtt cggcgtgccc ccagaggccg tggattccct gatgcgccgg 
gcggccaaga ccatcaactt cggcgtcctc tacggcatgt ccgcccaccg gctttcggga 
gagctggcca tcccctacga ggaggcggtg gccttcatcg agcggtattt ccagagctac 
cccaaggtgc gggcctggat tgagaaaacc ctggcggaag gacgggaacg gggctatgtg 
gaaaccctct ttggccgccg gcgctacgtg cccgacttgg cttcccgggt gaagagcatc 
cgggaggcag cggagcgcat ggccttcaac atgccggtcc aggggaccgc cgcggatttg 
atgaaactgg ccatggtgaa gctctttccc aggcttcagg agctgggggc caggatgctt 
ttgcaggtgc acgacgaact ggtcctcgag gctcccaagg agcaagcgga ggaagtcgcc 
caggaggcca agcggaccat ggaggaggtg tggcccctga aggtgccctt ggaggtggaa 
gtgggcatcg gggaggactg gctttccgcc aaggcctag 



<210> 254 

<211> 832 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 254 



1440 
1500 
1560 
1620 
1680 
1740 
1800 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2499 



Met Glu Phe Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
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20 



25 



30 



Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu lie Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg He Ser He Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn He 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Ala Lys Leu He Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 215 220 

Pro Ala Ser Val Arg Glu Lys He Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 



Leu Asn Ala Leu Ala Ala Ala Trp Glu Gly Arg Val Tyr Arg Ala Glu 
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325 330 335 

Asp Pro Leu Glu Ala Leu Arg Gly Leu Gly Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly He Ala Leu Ala 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Ala Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Glu Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu Tyr Ala Ala Leu 
405 410 415 

Leu Glu Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp Leu Tyr Glu Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Lys Ala Leu Ser Leu Glu Val Glu 
450 455 460 

Ala Glu He Arg Arg Phe Glu Glu Glu Val His Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val He Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 520 525 

Val Asp Arg He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys Gly Thr 
530 535 540 

Tyr He Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Asn Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Arg Leu Val Val 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Gin Asp He His Thr 
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625 



630 



635 



640 



Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Ser 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gly Glu Leu Ala lie Pro Tyr Glu Glu 
675 680 685 

Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Tyr Pro Lys Val Arg 
690 695 700 

Ala Trp lie Glu Lys Thr Leu Ala Glu Gly Arg Glu Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala Ser Arg 
725 730 735 

Val Lys Ser lie Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu Leu Gin Val His 
770 775 780 

Asp Glu Leu Val Leu Glu Ala Pro Lys Glu Gin Ala Glu Glu Val Ala 
785 790 795 800 

Gin Glu Ala Lys Arg Thr Met Glu Glu Val Trp Pro Leu Lys Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys Ala 
820 825 830 

<210> 255 
<211> 20 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 255 

cgatctcctc ggccacctcc 



<210> 256 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 256 

ggcggtgccc tggacgggca 



20 



<210> 257 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 257 

ccagctcgtt gtggacctga 



<210> 258 

<211> 2505 

<212> DNA 

<213> Thermus aquaticus 



<400> 258 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 



20 



60 
120 



caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 30 0 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 3 60 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 78 0 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 
gatcttctgg ccctggccgc cgccaggggg ggccgggtcc accgggcccc cgagccttat 



660 
720 



1020 
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aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt ccaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 12 00 

gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gtgggggagg 12 60 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 132 0 

gtcctggccc acatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 13 80 

tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 144 0 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 

cttcccgcca tcggcaagac ggagaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 162 0 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2 040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 222 0 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 80 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 24 60 

gaggtgggga taggggagga ctggctctcc gccaaggagt gatag 25 0 5 

<210> 259 
<211> 833 
<212> PRT 

<213> Thermus aquaticus 
<400> 259 
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Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 
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Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 
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Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 



Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 



Glu 



<210> 260 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 260 

caggaggagc tcgttgtgga cctgga 
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<210> 261 

<211> 836 

<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 261 

Met Asn Ser Glu 
1 

Leu Leu Val Asp 
20 

Lys Gly Leu Thr 
35 

Phe Ala Lys Ser 
50 

Val Phe Val Val 
65 

Tyr Glu Ala Tyr 



Arg Gin Leu Ala 
100 

Arg Leu Glu Val 
115 

Ala Lys Lys Ala 
130 

Asp Arg Asp Leu 
145 

Pro Glu Gly His 



Leu Arg Pro Glu 
180 

Ser Asp Asn Leu 
195 

Lys Leu Leu Lys 
210 

Asp Arg Val Lys 
225 

Glu Asp Leu Arg 



Ala Met Leu Pro 
5 

Gly His His Leu 



Thr Ser Arg Gly 
40 

Leu Leu Lys Ala 
55 

Phe Asp Ala Lys 
70 

Lys Ala Gly Arg 
85 

Leu lie Lys Glu 



Pro Gly Tyr Glu 
120 

Glu Lys Glu Gly 
135 

Tyr Gin Leu Val 
150 

Leu lie Thr Pro 
165 

Gin Trp Val Asp 



Pro Gly Val Lys 
200 

Glu Trp Gly Ser 
215 

Pro Glu Asn Val 
230 

Leu Ser Leu Glu 



Leu Phe Glu Pro 
10 

Ala Tyr Arg Thr 
25 

Glu Pro Val Gin 



Leu Lys Glu Asp 
60 

Ala Pro Ser Phe 
75 

Ala Pro Thr Pro 
90 

Leu Val Asp Leu 
105 

Ala Asp Asp Val 



Tyr Glu Val Arg 
140 

Ser Asp Arg Val 
155 

Glu Trp Leu Trp 
170 

Phe Arg Ala Leu 
185 

Gly He Gly Glu 



Leu Glu Asn Leu 
220 

Arg Glu Lys He 
235 

Leu Ser Arg Val 



Lys Gly Arg Val 
15 

Phe Phe Ala Leu 
30 

Ala Val Tyr Gly 
45 

Gly Tyr Lys Ala 



Arg His Glu Ala 
80 

Glu Asp Phe Pro 
95 

Leu Gly Phe Thr 
110 

Leu Ala Thr Leu 
125 

He Leu Thr Ala 



Ala Val Leu His 
160 

Glu Lys Tyr Gly 
175 

Val Gly Asp Pro 
190 

Lys Thr Ala Leu 
205 

Leu Lys Asn Leu 



Lys Ala His Leu 
240 

Arg Thr Asp Leu 
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245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
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545 550 



555 560 



Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 

<210> 262 



670 



<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<^AO0'> 262 
atgaattccg 


aggcgatgct 


tccgctcttt gaacccaaag gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


^ggcggtcta 


cggcttcgcc 


aagagcctcc tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta gggccttcct ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcagggacg gccgggtgca 


ccgggcagca 


1020 


gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccctc 


caacaccacc 


cccgaggggg tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


catggaggcc 


accggggtac ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc tggaaagggt 


gctctttgac 


1500 



671 



gagcttaggc ttcccgcctt ggggaagacg caaaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 162 0 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 174 0 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 198 0 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2 04 0 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt ccacaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtta g 2511 



<210> 263 

<211> 50 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 263 

tctagaggat ctatcagtgg tggtggtggt ggtgctcctt ggcggagagc 50 



<210> 264 

<211> 58 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 

<400> 264 
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tgcctgcagg tcgacgctag ctagtggtgg tggtggtggt gacccttggc ggaaagcc 



58 



<210> 265 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 265 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 3 60 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 54 0 



180 
240 
300 



600 
660 
720 



840 
900 



gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 
gatcttctgg ccctggccgc cgccaggggg ggccgggtcc accgggcccc cgagccttat 102 0 
aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 
gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 
ctggaccctt ccaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 
gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gtgggggagg 
cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 132 0 
gtcctggccc acatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 13 80 



1080 
1140 
1200 
1260 
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tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 

cttcccgcca tcggcaagac ggagaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 162 0 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 174 0 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 192 0 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 234 0 

ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 



<210> 266 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<22 3> Synthetic 
<400> 266 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 



2517 
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35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
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340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
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645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 267 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 267 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 3 00 
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ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 
gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 42 0 
atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 
cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 
cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 
ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 
ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 
gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 
gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 
gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 
gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 
atgtgggcgg agcttaaagc cctggccgcc tgcagggacg gccgggtgca ccgggcagca 
gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 
gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 
gcctacctcc tggacccctc caacaccacc cccgaggggg tggcgcggcg ctacgggggg 
gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 
cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 
ctctcccggg tcctggccca catggaggcc accggggtac ggcgggacgt ggcctacctt 
caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 
ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 
gagcttaggc ttcccgcctt ggggaagacg caaaagacag gcaagcgctc caccagcgcc 
gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 
gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 
acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 
tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 
gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 
cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 
gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 
ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 210 0 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt ccacaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 252 0 

caccac 2526 



<210> 268 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 268 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 
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Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 
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Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 



Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 
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Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 



Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 


269 


<211> 


30 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


269 



gccgccaggg gcggccgcgt ccaccgggcc 



<210> 


270 


<211> 


31 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


270 



gcctgcaggg gcggccgcgt gcaccggggc a 



<210> 


271 


<211> 


26 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


271 



ctcctggacc cttcgaacac cacccc 



<210> 272 
<211> 23 
<212> DNA 
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<213> Artificial 
<220> 

<223> Synthetic 
<400> 272 

gtcctggccc atatggaggc cac 23 



<210> 273 
<211> 2526 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 273 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 3 60 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 420 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 54 0 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 72 0 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 78 0 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 84 0 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 90 0 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 102 0 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 12 0 0 
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gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


13 8 0 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


150 0 


gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 




gagctcacca 


agctcaagaa cacctacgtg gaccccctcc 


caagcctcgt 


ccacccgagg 


T £r O Pi 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


174 0 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca ggaggggaag 


192 0 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


22 80 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


ccacaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 


caccac 












2526 



<210> 274 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 274 
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Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 ^5 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 
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Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 



Met Trp Ala Glu Leu 
325 

His Arg Ala Ala Asp 
340 

Arg Gly Leu Leu Ala 
355 

Leu Asp Leu Val Pro 
370 

Asp Pro Ser Asn Thr 
385 

Glu Trp Thr Glu Asp 
405 

His Arg Asn Leu Leu 
420 

Leu Tyr His Glu Val 
435 

Glu Ala Thr Gly Val 
450 

Leu Glu Leu Ala Glu 
465 

Leu Ala Gly His Pro 
485 

Val Leu Phe Asp Glu 
500 

Thr Gly Lys Arg Ser 
515 

Ala His Pro He Val 
530 

Leu Lys Asn Thr Tyr 
545 

Thr Gly Arg Leu His 
565 

Arg Leu Ser Ser Ser 
580 

Pro Leu Gly Gin Arg 
595 



Lys Ala Leu Ala Ala Cys 
330 

Pro Leu Ala Gly Leu Lys 
345 

Lys Asp Leu Ala Val Leu 
360 

Gly Asp Asp Pro Met Leu 
375 

Thr Pro Glu Gly Val Ala 
390 395 

Ala Ala His Arg Ala Leu 
410 

Lys Arg Leu Glu Gly Glu 
425 

Glu Lys Pro Leu Ser Arg 
440 

Arg Arg Asp Val Ala Tyr 
455 

Glu He Arg Arg Leu Glu 
470 475 

Phe Asn Leu Asn Ser Arg 
490 

Leu Arg Leu Pro Ala Leu 
505 

Thr Ser Ala Ala Val Leu 
520 

Glu Lys He Leu Gin His 
535 

Val Asp Pro Leu Pro Ser 
550 555 

Thr Arg Phe Asn Gin Thr 
570 

Asp Pro Asn Leu Gin Asn 
585 

He Arg Arg Ala Phe Val 
600 



Arg Gly Gly Arg Val 
335 

Asp Leu Lys Glu Val 
350 

Ala Ser Arg Glu Gly 
365 

Leu Ala Tyr Leu Leu 
380 

Arg Arg Tyr Gly Gly 
400 

Leu Ser Glu Arg Leu 
415 

Glu Lys Leu Leu Trp 
430 

Val Leu Ala His Met 
445 

Leu Gin Ala Leu Ser 
460 

Glu Glu Val Phe Arg 
480 

Asp Gin Leu Glu Arg 
495 

Gly Lys Thr Gin Lys 
510 

Glu Ala Leu Arg Glu 
525 

Arg Glu Leu Thr Lys 
540 

Leu Val His Pro Arg 
560 

Ala Thr Ala Thr Gly 
575 

He Pro Val Arg Thr 
590 

Ala Glu Ala Gly Trp 
605 
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Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 



His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 275 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 275 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 
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caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 
aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 
gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 
ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 
gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gtgggggagg 
cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 
gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 
tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 
caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 
cttcccgcca tcggcaagac ggagaagacc ggcaagcgct ccaccagcgc cgccgtcctg 
gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 
aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 
ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 
aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 



12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 



<210> 276 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 276 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2517 
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Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 
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Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 



Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 



Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 



691 



Val Glu Thr Leu 



Arg Val Lys Ser 
740 

Pro Val Gin Gly 
755 

Leu Phe Pro Arg 
770 

His Asn Glu Leu 
785 

Ala Arg Leu Ala 



Pro Leu Glu Val 
820 



Phe Gly Arg Arg 
725 

Val Arg Glu Ala 



Thr Ala Ala Asp 
760 

Leu Glu Glu Met 
775 

Val Leu Glu Ala 
790 

Lys Glu Val Met 
805 

Glu Val Gly lie 



Arg Tyr Val Pro 
730 

Ala Glu Arg Met 
745 

Leu Met Lys Leu 



Gly Ala Arg Met 
780 

Pro Lys Glu Arg 
795 

Glu Gly Val Tyr 
810 

Gly Glu Asp Trp 
825 



Asp Leu Glu Ala 
735 

Ala Phe Asn Met 
750 

Ala Met Val Lys 
765 

Leu Leu Gin Val 



Ala Glu Ala Val 
800 

Pro Leu Ala Val 
815 

Leu Ser Ala Lys 
830 



Glu His His His His His His 
835 



<210> 277 

<211> 27 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 277 

gaggaggcgg ggcaccgggc cgccctt 



<210> 278 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 278 

ctttccgaga ggctccatcg gaacctgtgg gggagg 



<210> 279 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 

<400> 279 
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ctcttcgcca acctgcttaa gaggcttgag ggggag 



36 



<210> 280 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 280 

aggccccttt cccgggtcct ggcccat 



<210> 281 

<211> 27 

<212> DHA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 281 

acgggggtgc gccgggacgt ggcctat 



<210> 282 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 282 

gtggcctatc tccaggcctt gtccctg 



<210> 283 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 283 

ttgtccctgg agcttgccga ggagatc 



<210> 284 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 
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<223> Synthetic 



<400> 284 

gccgaggaga tccgccgcct cgaggcc 



<210> 285 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 285 

gcccgcctcg aggaggaggt cttccgc 



<210> 286 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 286 

tttgacgagc taaggcttcc cgccatc 



<210> 287 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 287 

atcgccaaga cgcaaaagac cggcaag 



<210> 288 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 288 

aagatcctgc agcaccggga gctcacc 



<210> 289 
<211> 27 
<212> DNA 



<213> Artificial 



<220> 

<223> Synthetic 
<400> 289 

accaagctga agaacaccta cattgac 



<210> 290 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 290 

aagagcacct acgtggaccc cttgccg 



<210> 291 

<211> 39 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 291 

attgacccct tgccgagcct cgtccacccc aggacgggc 



<210> 292 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 292 

ttcgccaacc tgcttgggag gcttgagggg gag 



<210> 293 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 293 

ctcttcgcca acctgtggaa gaggcttgag ggg 
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<210> 294 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 294 

gcttgcggtc tgggtggcga tgtccttccc etc 33 



<210> 295 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 295 

catgttgaag gccatggcct ccgcggcctc cct 3 3 



<210> 296 

<211> 30 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 296 

caggaggagc tcgttggcga cctggaggag 3 0 



<210> 297 

<211> 45 

<212^ DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 297 

ggagcgcttg cctgtcttct tcgtcttctt caaggcggga ggcct 4 5 



<210> 298 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 298 
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gaggaccagc tcgttggcga 



<210> 299 

<211> 33 

<212> DMA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 299 

gaggggcggg acatcgccac 



<210> 300 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 300 

cagaacatcc ccgtcgccac 



<210> 301 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 301 
gggcttcccg ccatcaagaa 



<210> 302 

<211> 39 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 302 
ctagggcttc ccgccatcaa 



<210> 303 

<211> 39 

<212> DNA 

<213> Artificial 

<220> 



cctgaaggag cat 



ggagaccgcc age 



cccgcttggg cag 



gacggagaag acc 



gaagacgcaa aagaccggc 
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<223> Synthetic 



<400> 303 

ccggggaaag tcctcctccg tctcggcccg gcccgcctt 



<210> 304 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 304 

cgggacctcg aggcgcgtga accccaggag gtccac 



<210> 305 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 305 
ctcctccacg agttcggc 



<210> 306 

<211> 48 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 306 

accggtcttc ttcgtcttct tcaacttggg aagcctgagc tcgtcaaa 



<210> 


307 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


307 



aagacgaaga agaccggtaa gcgctccacc age 



<210> 308 
<211> 52 
<212> DNA 
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<213> Artificial 



<220> 

<223> Synthetic 
<400> 308 

gtcgactcta gatcagtggt ggtggtggtg gtgcttggcc gcccggcgca tc 52 



<210> 309 

<211> 60 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<221> modif ied_base 

<222> (19) . . (42) 

<223> The bases in these positions within tH primer are 91% of the base 
shown and 3% each of the other 3 nucleotides 



<400> 309 

ggagcgctta ccggtctttt gcgtcttctt gatcttggga agccttagct cgtcaaagag 60 



<210> 310 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 310 

ctcctccacg agttcggc IB 



<210> 311 

<211> 60 

<212> DN-A 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<221> modif ied_base 

<222> (19) . . (42) 

<223> The bases at these positions within tH primer are 91% of the base 
shown and 3% each of the other 3 nucleotides. 



<400> 311 

caaaagaccg gtaagcgctc caccagcgcc gccgtcctgg aggccctccg cgaggcccac 60 
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<210> 312 

<211> 52 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 312 

gtcgactcta gatcagtggt ggtggtggtg gtgcttggcc gcccggcgca tc 



<210> 313 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 313 

gtcggagggg tcccccacga g 



<210> 314 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 314 
tgtggaattg tgagcgg 



<210> 315 

<211> 75 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<221> modif ied_base 

<222> (28) . . (59) 

<223> The bases in these positions within tH primer are 91% of the 
shown and 3% each of the other 3 nucleotides. 



<400> 315 

ctcgtggggg acccctccga caacctcccc ggggtcaagg gcatcgggga gaagaccgcc 
ctcaagcttc tcaag 
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<210> 316 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 316 

gtggcctcca tatgggccag gac 



<210> 317 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 317 

cgggacctcg aggcgcgtga accccaggag gtccac 



<210> 318 

<211> 39 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 318 

ccggggaaag tcctcctccg tctcggcccg gcccgcctt 



<210> 319 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 319 

gtcggactcg tcaccggtca gggc 



<210> 320 

<211> 75 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<220> 

<221> modif ied_base 

<222> (28).. (60) ^ -u u 

<223> The bases in these positions within tH primer are 91% of the base 
shown and 3% each of the other 3 nucleotides. 



<400> 320 

ctgaccggtg acgagtccga caaccttccc ggggtcaagg gcatcgggga gaggacggcg 60 

75 

aggaagcttc tggag 



<210> 321 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 321 

tagctcctgg gagagggcgt gggccgacat gcc 



<210> 322 

<211> 34 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 322 

cttccagaac ctctttaaac ggctttccga gaag 



<210> 323 

<211> 34 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 323 

cttctcggaa agccgtttaa agaggttctg gaag 



<210> 324 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 324 

ccggtgggcc ggacgcagaa gacgggcaag c 



<210> 325 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 325 

gcttgcccgt cttctgcgtc cggcccaccg g 



<210> 326 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 326 

ctcctccaag tgcacaacga gctggtcctg g 



<210> 327 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 327 

ccaggaccag ctcgttgtgc acttggagga g 



<210> 328 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 328 

gccgccctcc tgaagcggct taaggg 



<210> 329 

<211> 26 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 329 

cccttaagcc gcttcaggag ggcggc 



<210> 330 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 330 

atcggcaaga cgcagaagac gggcaagc 



<210> 331 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 331 

gcttgcccgt cttctgcgtc ttgccgat 



<210> 332 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 332 

ttgcaggtgc acaacgaact ggtcctc 



<210> 333 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 

<400> 333 

gaggaccagt tcgttgtgca cctgcaa 



<210> 334 



<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 334 

cagaccatga attccacccc actttttgac ctggag 3 6 



<210> 335 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 335 

gtggacgcgg ccgcccgagg ccgccgccag ggccag 36 



<210> 336 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 336 

cagaccatga attccctgcc cctctttgag cccaag 36 



<210> 337 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 337 

gtaaaccgcg ccgccccagg cggcggccaa ggcgtt 36 



<210> 338 

<211> 30 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 338 

gaggtggagc ggcccctctc ccgggtcttg 30 
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<210> 339 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 339 

caagacccgg gagaggggcc gctccacctc 



<210> 340 

<211> 2505 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 340 

atggaattca ccccactttt tgacctggag gaacccccca agcgggtgct tctggtggac 



30 



60 
120 
180 
240 



360 
420 



ggccaccacc tggcctaccg caccttctat gccctgagcc tcaccacctc ccggggggag 
ccggtgcaga tggtctacgg cttcgcccgg agcctcctca aggccttgaa ggaggacgga 
caggcggtgg tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacgag 
gcctacaagg cgggccgggc ccccaccccg gaggacttcc cccgccagct cgccttggtc 300 
aagcggctgg tggaccttct gggcctggtc cgcctcgagg ccccggggta cgaggcggac 
gacgtcctgg gcaccctggc caagaaggcc gaaagggagg ggatggaggt gcgcatcctc 
acgggagacc gggacttctt ccagctcctc tccgagaagg tctcggtcct cctgccggac 480 
gggaccctgg tcaccccaaa ggacgtccag gagaagtacg gggtgccccc ggagcgctgg 54 0 
gtggacttcc gcgccctcac gggggaccgc tcggacaaca tccccggggt ggcggggata 600 
ggggagaaga ccgcccttcg actcctcgca gagtggggga gcgtggaaaa cctcctgaag 660 
aacctggacc gggtaaagcc ggactcgctc cggcgcaaga tagaggcgca cctcgaggac 72 0 
ctccacctct ccttagacct ggcccgcatc cgcaccgacc tccccctgga ggtggacttt 
aaggccctgc gccgcaggac ccccgacctg gagggcctga gggccttttt ggaggagctg 
gagttcggaa gcctcctcca cgagttcggc ctcctgggag gggagaagcc ccgggaggag 
gccccctggc ccccgcccga aggggccttc gtgggcttcc tcctttcccg caaggagccc 960 
atgtgggcgg agcttctggc cctggcggcg gcctcggagg gccgggtcca ccgggcaaca 1020 



780 
840 
900 
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agcccggttg 


aggccctggc 


cgacctcaag 


gaggcccggg 


ggttcctggc 


caaggacctg 


1080 


gccgttttgg 


ccctgcggga 
— f — f —J — / 


qgqggtggcc 

ZD ZJ Zi Z> Z? 


ctggacccca 


cggacgaccc 


cctcctggtg 


1140 


gcctacctcc 


tggacccggc 


caacacccac 


cccgaggggg 


tggcccggcg 


ctacgggggc 


1200 


gagttcacgg 


aggacgcagc 


ggagagggcc 


ctcctctccg 


agaggctctt 


ccagaacctc 


1260 


tttaaacggc 


tttccgagaa 


gctcctctgg 


ctctaccagg 


aggtggagcg 


gcccctctcc 


1320 


cgggtcttgg 


cccacatgga 


ggcccggggg 


gtgaggctgg 


acgtccccct 


tctggaggcc 


1380 


ctctcctttg 


agctggagaa 


ggagatggag 


cgcctggagg 


gggaggtctt 


ccgtttggcc 


1440 


ggccacccct 


tcaacctcaa 


ctcccgcgac 


cagctggaaa 


gggtcctctt 


tgacgagctg 


1500 


ggcctcaccc 


cggtgggccg 


gacgcagaag 


acgggcaagc 


gctccaccgc 


ccagggggcc 


1560 


ctggaggccc 


tccggggggc 


ccaccccatc 


gtggagctca 


tcctccagta 


ccgggagctt 


1620 


tccaagctca 


aaagcaccta 


cctggacccc 


ctgccccggc 


tcgtccaccc 


gcggacgggc 


1680 


cggctccaca 


cccgcttcaa 


ccagacggcc 


acggccacgg 


gaaggctttc 


cagctccgac 


1740 


cccaacctgc 


agaacatccc 


cgtgcgcacc 


cccttggggc 

ZJZjZiZ) 


agcgcatccg 


caaggccttc 


1800 


gtggccgagg 


aggggtggct 


ccttttggcg 


gcggactact 


cccagattga 


gctccgggtc 


1860 


ctggcccacc 


tctcggggga 


cgagaacctg 


aagcgggtct 


tccgggaggg 


gaaggacatc 


1920 


cataccgaga 


ccgccgcctg 


gatgttcggc 


ttagaccccg 


ctctggtgga 


tccaaagatg 


1980 


cgccgggcgg 


ccaagacggt 


caacttcggc 


gtcctctacg 


ggatgtccgc 


ccacaggctc 


2040 


tcccaggagc 


tcggcataga 


ctacaaggag 


gcggaggcct 


ttattgagcg 


ctacttccag 


2100 


agcttcccca 


aqqtqcqqqc 


ctggatagaa 


aqqaccctqq 

-3—) ZiZi 


aqqaqqqccq 


gacgcggggc 


2160 


tacqtqgaqa 


CCCtqttCqq 


caqqaqqcqc 


tatgtgcccg 


acctggcctc 


ccgggtccgc 


2220 


tcggtgcggg 


agqcqqcqqa 


qcqqatqqcc 


ttcaacatgc 


ccqtqcaqqq 

3 ^ ZJ J ZJ Z) 


caccgccgcc 


2280 


gacctgatga 


agatcgccat 


ggtcaagctc 


ttccccaggc 


taaagcccct 


gggggcccac 


2340 


ctcctcctcc 


aagtgcacaa 


cgagctggtc 


ctggaggtgc 


ccgaggaccg 


ggccgaggag 


2400 


gccaaggccc 


tggtcaagga 


ggtcatggag 


aacgcctacc 


ccctggacgt 


gcccctcgag 


2460 


gtggaggtgg 


gcgtgggtcg 


ggactggctg 


gaggcgaagc 


aggat 




2505 



<210> 341 

<211> 835 

<212> PRT 

<213> Artificial 

<220> 
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<223> Synthetic 



<400> 341 

Met Glu Phe Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 

Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg lie Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn lie Pro Gly Val Ala Gly lie Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys lie Glu Ala His Leu Glu Asp 
225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg lie Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 
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Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 



Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Glu Gly Arg Val 
325 330 335 

His Arg Ala Thr Ser Pro Val Glu Ala Leu Ala Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Phe Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Val Ala Leu Asp Pro Thr Asp Asp Pro Leu Leu Val Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ala Asn Thr His Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Phe Thr Glu Asp Ala Ala Glu Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

Phe Gin Asn Leu Phe Lys Arg Leu Ser Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

Gin Glu Val Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Arg Gly Val Arg Leu Asp Val Pro Leu Leu Glu Ala Leu Ser Phe Glu 
450 455 460 

Leu Glu Lys Glu Met Glu Arg Leu Glu Gly Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Gly Leu Thr Pro Val Gly Arg Thr Gin Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ala Gin Gly Ala Leu Glu Ala Leu Arg Gly Ala His 
515 520 525 

Pro lie Val Glu Leu He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys 
530 535 540 

Ser Thr Tyr Leu Asp Pro Leu Pro Arg Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu 
580 585 590 
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Gly Gin Arg He Arg Lys Ala Phe Val Ala Glu Glu Gly Trp Leu Leu 
595 600 605 

Leu Ala Ala Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu Lys Arg Val Phe Arg Glu Gly Lys Asp He 
625 630 635 640 

His Thr Glu Thr Ala Ala Trp Met Phe Gly Leu Asp Pro Ala Leu Val 
645 650 655 

Asp Pro Lys Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Gly He Asp Tyr 
675 680 685 

Lys Glu Ala Glu Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp He Glu Arg Thr Leu Glu Glu Gly Arg Thr Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala 
725 730 735 

Ser Arg Val Arg Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys He Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Lys Pro Leu Gly Ala His Leu Leu Leu Gin 
770 775 780 

Val His Asn Glu Leu Val Leu Glu Val Pro Glu Asp Arg Ala Glu Glu 
785 790 795 800 

Ala Lys Ala Leu Val Lys Glu Val Met Glu Asn Ala Tyr Pro Leu Asp 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Val Gly Arg Asp Trp Leu Glu Ala 
820 825 830 

Lys Gin Asp 
835 

<210> 342 

<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 342 

tggcggcggc ctcgggcggc cgcgtccacc gggcaaca 



38 



<210> 343 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 

<400> 343 

cttctctcat ccgccaaaac agcc 



<210> 344 

<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 344 

tggccgccgc ctggggcggc cgcgtttacc gggcggag 



<210> 345 

<211> 2505 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 345 

atgaattcca ccccactttt tgacctggag gaacccccca agcgggtgct tctggtggac 
ggccaccacc tggcctaccg caccttctat gccctgagcc tcaccacctc ccggggggag 
ccggtgcaga tggtctacgg cttcgcccgg agcctcctca aggccttgaa ggaggacgga 
caggcggtgg tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacgag 
gcctacaagg cgggccgggc ccccaccccg gaggacttcc cccgccagct cgccttggtc 
aagcggctgg tggaccttct gggcctggtc cgcctcgagg ccccggggta cgaggcggac 
gacgtcctgg gcaccctggc caagaaggcc gaaagggagg ggatggaggt gcgcatcctc 
acgggagacc gggacttctt ccagctcctc tccgagaagg tctcggtcct cctgccggac 
gggaccctgg tcaccccaaa ggacgtccag gagaagtacg gggtgccccc ggagcgctgg 
gtggacttcc gcgccctcac gggggaccgc tcggacaaca tccccggggt ggcggggata 



24 



38 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
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ggggagaaga ccgcccttcg actcctcgca gagtggggga gcgtggaaaa cctcctgaag 660 

aacctggacc gggtaaagcc ggactcgctc cggcgcaaga tagaggcgca cctcgaggac 72 0 

ctccacctct ccttagacct ggcccgcatc cgcaccgacc tccccctgga ggtggacttt 780 

aaggccctgc gccgcaggac ccccgacctg gagggcctga gggccttttt ggaggagctg 84 0 

gagttcggaa gcctcctcca cgagttcggc ctcctgggag gggagaagcc ccgggaggag 900 

gccccctggc ccccgcccga aggggccttc gtgggcttcc tcctttcccg caaggagccc 960 

atgtgggcgg agcttctggc cctggcggcg gcctcgggcg gccgcgtcca ccgggcaaca 1020 

agcccggttg aggccctggc cgacctcaag gaggcccggg ggttcctggc caaggacctg 1080 

gccgttttgg ccctgcggga gggggtggcc ctggacccca cggacgaccc cctcctggtg 114 0 

gcctacctcc tggacccggc caacacccac cccgaggggg tggcccggcg ctacgggggc 1200 

gagttcacgg aggacgcagc ggagagggcc ctcctctccg agaggctctt ccagaacctc 12 60 

tttaaacggc tttccgagaa gctcctctgg ctctaccagg aggtggagcg gcccctctcc 132 0 

cgggtcttgg cccacatgga ggcccggggg gtgaggctgg acgtccccct tctggaggcc 13 80 

ctctcctttg agctggagaa ggagatggag cgcctggagg gggaggtctt ccgtttggcc 144 0 

ggccacccct tcaacctcaa ctcccgcgac cagctggaaa gggtcctctt tgacgagctg 1500 

ggcctcaccc cggtgggccg gacgcagaag acgggcaagc gctccaccgc ccagggggcc 1560 

ctggaggccc tccggggggc ccaccccatc gtggagctca tcctccagta ccgggagctt 162 0 

tccaagctca aaagcaccta cctggacccc ctgccccggc tcgtccaccc gcggacgggc 168 0 

cggctccaca cccgcttcaa ccagacggcc acggccacgg gaaggctttc cagctccgac 174 0 

cccaacctgc agaacatccc cgtgcgcacc cccttggggc agcgcatccg caaggccttc 1800 

gtggccgagg aggggtggct ccttttggcg gcggactact cccagattga gctccgggtc 1860 

ctggcccacc tctcggggga cgagaacctg aagcgggtct tccgggaggg gaaggacatc 192 0 

cataccgaga ccgccgcctg gatgttcggc ttagaccccg ctctggtgga tccaaagatg 1980 

cgccgggcgg ccaagacggt caacttcggc gtcctctacg ggatgtccgc ccacaggctc 2 04 0 

tcccaggagc tcggcataga ctacaaggag gcggaggcct ttattgagcg ctacttccag 2100 

agcttcccca aggtgcgggc ctggatagaa aggaccctgg aggagggccg gacgcggggc 2160 

tacgtggaga ccctgttcgg caggaggcgc tatgtgcccg acctggcctc ccgggtccgc 222 0 

tcggtgcggg aggcggcgga gcggatggcc ttcaacatgc ccgtgcaggg caccgccgcc 22 80 

gacctgatga agatcgccat ggtcaagctc ttccccaggc taaagcccct gggggcccac 2340 
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ctcctcctcc aagtgcacaa cgagctggtc ctggaggtgc ccgaggaccg ggccgaggag 
gccaaggccc tggtcaagga ggtcatggag aacgcctacc ccctggacgt gcccctcgag 
gtggaggtgg gcgtgggtcg ggactggctg gaggcgaagc aggat 



2400 
2460 
2505 



<210> 346 

<211> 835 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 346 

Met Asn Ser Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 

Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Net Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg lie Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn lie Pro Gly Val Ala Gly lie Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 
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Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys He Glu Ala His Leu Glu Asp 
225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg He Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

I^et Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly Arg Val 
325 330 335 

His Arg Ala Thr Ser Pro Val Glu Ala Leu Ala Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Phe Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Val Ala Leu Asp Pro Thr Asp Asp Pro Leu Leu Val Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ala Asn Thr His Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Phe Thr Glu Asp Ala Ala Glu Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

Phe Gin Asn Leu Phe Lys Arg Leu Ser Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

Gin Glu Val Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Arg Gly Val Arg Leu Asp Val Pro Leu Leu Glu Ala Leu Ser Phe Glu 
450 455 460 

Leu Glu Lys Glu Met Glu Arg Leu Glu Gly Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Gly Leu Thr Pro Val Gly Arg Thr Gin Lys Thr Gly 
500 505 510 
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Lys Arg Ser Thr Ala Gin Gly Ala Leu Glu Ala Leu Arg Gly Ala His 
515 520 525 

Pro He Val Glu Leu He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys 
530 535 540 

Ser Thr Tyr Leu Asp Pro Leu Pro Arg Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg He Arg Lys Ala Phe Val Ala Glu Glu Gly Trp Leu Leu 
595 600 605 

Leu Ala Ala Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu Lys Arg Val Phe Arg Glu Gly Lys Asp He 
625 630 635 640 

His Thr Glu Thr Ala Ala Trp Met Phe Gly Leu Asp Pro Ala Leu Val 
645 650 655 

Asp Pro Lys Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Gly He Asp Tyr 
675 680 685 

Lys Glu Ala Glu Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp He Glu Arg Thr Leu Glu Glu Gly Arg Thr Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala 
725 730 735 

Ser Arg Val Arg Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys He Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Lys Pro Leu Gly Ala His Leu Leu Leu Gin 
770 775 780 

Val His Asn Glu Leu Val Leu Glu Val Pro Glu Asp Arg Ala Glu Glu 
785 790 795 800 

Ala Lys Ala Leu Val Lys Glu Val Met Glu Asn Ala Tyr Pro Leu Asp 
805 810 815 
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Val Pro Leu Glu Val Glu Val Gly Val Gly Arg Asp Trp Leu Glu Ala 



Lys Gin Asp 
835 

<210> 347 
<211> 2496 
<212> DMA 
<213> Artificial 

<220> 

<22 3> Synthetic 
<400> 347 

atgaattccc tgcccctctt tgagcccaag ggccgggtgc ttctggtgga cggccaccac 60 

ctggcctacc gtaccttttt tgccctgaag ggcctcacca ccagccgcgg ggagccggtc 12 0 

caggcggtgt acgggtttgc caagagcctt ttgaaggcgc taagggaaga cggggatgtg 180 

gtgatcgtgg tgtttgacgc caaggccccc tccttccgcc accagaccta cgaggcctac 24 0 

aaggcggggc gggctcccac ccccgaggac tttccccggc agcttgccct tatcaaggag 300 

atggtggacc ttttgggcct ggagcgcctc gaggtgccgg gctttgaagc ggatgacgtc 3 60 

ctggctaccc tggccaagaa ggcggaaaag gaaggctacg aagtgcgcat cctcaccgcg 42 0 

gaccgggacc tttaccagct tctttcggag cgaatctcca tccttcaccc ggagggttac 480 

ctgatcaccc cggagtggct ttgggagaag tatgggctta agccttccca gtgggtggac 54 0 

taccgggcct tggccgggga cccttccgac aacatccccg gcgtgaaggg catcggggag 600 

aagacggcgg ccaagctgat ccgggagtgg ggaagcctgg aaaaccttct taagcacctg 660 

gaacaggtga aacctgcctc cgtgcgggag aagatcctta gccacatgga ggacctcaag 72 0 

ctatccctgg agctatcccg ggtgcacacg gacttgctcc ttcaggtgga cttcgcccgg 780 

cgccgggagc cggaccggga ggggcttaag gcctttttgg agaggctgga gttcggaagc 84 0 

ctcctccacg agttcggcct gttggaaagc ccggtggcgg cggaggaagc tccctggccg 900 

ccccccgagg gagccttcgt ggggtacgtt ctttcccgcc ccgagcccat gtgggcggag 960 

cttaacgcct tggccgccgc ctggggcggc cgcgtttacc gggcggagga tcccttggag 102 0 

gccttgcggg ggcttgggga ggtgaggggg cttttggcca aggacctggc ggtgctggcc 108 0 

ctgagggaag ggattgccct ggcaccgggc gacgacccca tgctcctcgc ctacctcctg 114 0 

gatccttcca acaccgcccc cgaaggggta gcccggcgct acggggggga gtggaccgag 1200 

gaggcggggg aaagggcgct gctttccgaa aggctttacg ccgccctcct gaagcggctt 1260 
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^^99999^99 agaggcttct ttggctttac gaggaggtgg aaaagcccct ttcgcgggtc 1320 

ctggcccaca tggaggccac gggggtacgg ttggatgtgg cctacttaaa ggccctttcc 13 80 

ctggaggtgg aggcggagat aaggcgcttc gaggaggagg tccaccgcct ggccgggcat 1440 

cctttcaacc tgaactcccg ggaccagctg gaaagggtca tctttgacga gcttgggctt 1500 

cccgccatcg gcaagacgca gaagacgggc aagcgctcca ccagcgccgc cgttttggag 1560 

gccttgcggg aggctcatcc catcgtggac cgcatccttc agtaccggga gctttccaag 1620 

ctcaagggaa cctacatcga tcccttgcct gccctggtcc accccaagac gaaccgcctc 16 80 

cacacccgtt tcaaccagac ggccaccgcc acggggaggc ttagcagctc ggatcctaat 1740 

ctgcaaaata tccccgtgcg cacccctttg ggccagcgga tccgccgggc cttcgtggcc 1800 

gaggaggggt ggaggctggt ggttttggac tacagccaga ttgagctcag ggtcctggcg 1860 

cacctttccg gggacgagaa cctaatccgg gtcttccagg agggccagga catccacacc 1920 

cagacggcca gctggatgtt cggcgtgccc ccagaggccg tggattccct gatgcgccgg 1980 

gcggccaaga ccatcaactt cggcgtcctc tacggcatgt ccgcccaccg gctttcggga 2040 

gagctggcca tcccctacga ggaggcggtg gccttcatcg agcggtattt ccagagctac 2100 

cccaaggtgc gggcctggat tgagaaaacc ctggcggaag gacgggaacg gggctatgtg 2160 

gaaaccctct ttggccgccg gcgctacgtg cccgacttgg cttcccgggt gaagagcatc 222 0 

cgggaggcag cggagcgcat ggccttcaac atgccggtcc aggggaccgc cgcggatttg 22 8 0 

atgaaactgg ccatggtgaa gctctttccc aggcttcagg agctgggggc caggatgctt 2340 

ttgcaggtgc acaacgaact ggtcctcgag gctcccaagg agcaagcgga ggaagtcgcc 2400 

caggaggcca agcggaccat ggaggaggtg tggcccctga aggtgccctt ggaggtggaa 2460 

gtgggcatcg gggaggactg gctttccgcc aaggcc 24 96 

<210> 348 
<211> 832 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 348 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 



Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
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20 



25 



30 



Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu lie Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg lie Ser lie Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn lie 
180 185 190 

Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Ala Lys Leu lie Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 215 220 

Pro Ala Ser Val Arg Glu Lys lie Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val Tyr Arg Ala Glu 
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325 330 335 

Asp Pro Leu Glu Ala Leu Arg Gly Leu Gly Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly lie Ala Leu Ala 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Ala Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Glu Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu Tyr Ala Ala Leu 
405 410 415 

Leu Lys Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp Leu Tyr Glu Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Lys Ala Leu Ser Leu Glu Val Glu 
450 455 460 

Ala Glu lie Arg Arg Phe Glu Glu Glu Val His Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val lie Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 520 525 

Val Asp Arg He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys Gly Thr 
530 535 540 

Tyr He Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Asn Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Arg Leu Val Val 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Gin Asp He His Thr 
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625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Ser 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gly Glu Leu Ala lie Pro Tyr Glu Glu 
675 680 685 

Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Tyr Pro Lys Val Arg 
690 695 700 

Ala Trp lie Glu Lys Thr Leu Ala Glu Gly Arg Glu Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala Ser Arg 
725 730 735 

Val Lys Ser lie Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu Leu Gin Val His 
770 775 780 

Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Gin Ala Glu Glu Val Ala 
785 790 795 800 

Gin Glu Ala Lys Arg Thr Met Glu Glu Val Trp Pro Leu Lys Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys Ala 
820 825 830 

<210> 349 
<211> 2526 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 349 

atgaattcca ccccactttt tgacctggag gaacccccca agcgggtgct tctggtggac 60 
ggccaccacc tggcctaccg caccttctat gccctgagcc tcaccacctc ccggggggag 12 0 
ccggtgcaga tggtctacgg cttcgcccgg agcctcctca aggccttgaa ggaggacgga 180 
caggcggtgg tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacgag 24 0 
gcctacaagg cgggccgggc ccccaccccg gaggacttcc cccgccagct cgccttggtc 300 
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aagcggctgg 


tggaccttct 


999ctttacc 


cgcctcgagg 


ccccggggta 


cgaggcggac 


aacQtcctQQ 


gcaccctggc 


caagaaggcc 


qaaaqqqaqq 


ggatggaggt 


gcgcatcctc 


acgggagacc 


gggacttctt 


ccagctcctc 


tccgagaagg 


tctcggtcct 


cctgccggac 


gggaccctgg 


tcaccccaaa 


ggacgtccag 


gagaagtacg 


gggtgccccc 


9ga9C9ctgg 


gtggacttcc 


gcgccctcac 


gggggaccgc 


tcggacaaca 


tccccggggt 


ggcggggata 


ggggagaaga 


ccgcccttcg 


actcctcgca 


gagtggggga 


gcgtggaaaa 


cctcctgaag 


aacctggacc 


gggtaaagcc 


ggactcgctc 


cggcgcaaga 


tagaggcgca 


cctcgaggac 


ctccacctct 


ccttagacct 


ggcccgcatc 


cgcaccgacc 


tccccctgga 


ggtggacttt 


aaggccctgc 


gccgcaggac 


ccccgacctg 


gagggcctga 


gggccttttt 


99^99^9ctg 


gagttcggaa 


gcctcctcca 


cgagttcggc 


ctcctgggag 


gggagaagcc 


ccgggaggag 


gccccctggc 


ccccgcccga 


^99g9<^cttc 


gtgggcttcc 


tcctttcccg 


caaggagccc 


atgtgggcgg 


agcttctggc 


cctggcggcg 


gcctcgggcg 


gccgcgtcca 


ccgggcaaca 


aqcccqqttq 


aqqccctqqc 


ggacctcaag 


qaqqtccgqg 


gcctcctcgc 


caaggacctc 


qccqtcttqq 


CCtcqaqqqa 


qqqqctaqac 


ctcgtgcccg 


999acgaccc 


catgctcctc 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccqagqggq 


tggcgcggcg 


ctacgggggg 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


cttaagcgcc 


tcgaggggga 


qqaqaaqctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


ctctcccggg 


tcctggccca 


tatqqaqqcc 

^ ^ ^ ^ ^ ^ 


accqqgqtac 


ggcgggacgt 


ggcctacctt 


caggcccttt 


ccctggagct 


tqcqqaqqaq 


atccgccgcc 


tcqaqqaqqa 


ggtcttccgc 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tqqaaaqqqt 


gctctttgac 


QaqcttaQQC 


ttcccgcctt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


qcqqtqctqq 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


acqqqccqcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacqqqqaq 


gcttagtagc 


t ccgacccca 


acc tgcagaa 


catccccgtc 


cgcaccccct 




gatccgccgg 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


99^9999^^9 


gacatccaca 


ccca.q3iccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


ctgatgcgcc 


9g9c:99ccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2 34 0 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 24 00 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 252 0 

caccac 2526 

<210> 350 
<211> 2505 
<212> DUA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 350 

atgaattcca ccccactttt tgacctggag gaacccccca agcgggtgct tctggtggac 6 0 

ggccaccacc tggcctaccg caccttctat gccctgagcc tcaccacctc ccggggggag 12 0 

ccggtgcaga tggtctacgg cttcgcccgg agcctcctca aggccttgaa ggaggacgga 180 

caggcggtgg tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacgag 240 

gcctacaagg cgggccgggc ccccaccccg gaggacttcc cccgccagct cgccttggtc 300 

aagcggctgg tggaccttct gggctttacc cgcctcgagg ccccggggta cgaggcggac 360 

gacgtcctgg gcaccctggc caagaaggcc gaaagggagg ggatggaggt gcgcatcctc 420 

acgggagacc gggacttctt ccagctcctc tccgagaagg tctcggtcct cctgccggac 480 

gggaccctgg tcaccccaaa ggacgtccag gagaagtacg gggtgccccc ggagcgctgg 540 

gtggacttcc gcgccctcac gggggaccgc tcggacaaca tccccggggt ggcggggata 60 0 

gggg^gaaga ccgcccttcg actcctcgca gagtggggga gcgtggaaaa cctcctgaag 660 

aacctggacc gggtaaagcc ggactcgctc cggcgcaaga tagaggcgca cctcgaggac 72 0 

ctccacctct ccttagacct ggcccgcatc cgcaccgacc tccccctgga ggtggacttt 780 

aaggccctgc gccgcaggac ccccgacctg gagggcctga gggccttttt ggaggagctg 84 0 
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gagttcggaa gcctcctcca cgagttcggc ctcctgggag gggagaagcc ccgggaggag 900 
gccccctggc ccccgcccga aggggccttc gtgggcttcc tcctttcccg caaggagccc 960 
atgtgggcgg agcttctggc cctggcggcg gcctcgggcg gccgcgtcca ccgggcaaca 
agcccggttg aggccctggc cgacctcaag gaggcccggg ggttcctggc caaggacctg 
gccgttttgg ccctgcggga gggggtggcc ctggacccca cggacgaccc cctcctggtg 
gcctacctcc tggacccggc caacacccac cccgaggggg tggcccggcg ctacgggggc 
gagttcacgg aggacgcagc ggagagggcc ctcctctccg agaggctctt ccagaacctc 
tttaaacggc tttccgagaa gctcctctgg ctctaccagg aggtggagcg gcccctctcc 
cgggtcttgg cccacatgga ggcccggggg gtgaggctgg acgtccccct tctggaggcc 
ctctcctttg agctggagaa ggagatggag cgcctggagg gggaggtctt ccgtttggcc 
ggccacccct tcaacctcaa ctcccgcgac cagctggaaa gggtcctctt tgacgagctg 
ggcctcaccc cggtgggccg gacgcagaag acgggcaagc gctccaccgc ccagggggcc 
ctggaggccc tccggggggc ccaccccatc gtggagctca tcctccagta ccgggagctt 
tccaagctca aaagcaccta cctggacccc ctgccccggc tcgtccaccc gcggacgggc 
cggctccaca cccgcttcaa ccagacggcc acggccacgg gaaggctttc cagctccgac 
cccaacctgc agaacatccc cgtgcgcacc cccttggggc agcgcatccg caaggccttc 
gtggccgagg aggggtggct ccttttggcg gcggactact cccagattga gctccgggtc 
ctggcccacc tctcggggga cgagaacctg aagcgggtct tccgggaggg gaaggacatc 
cataccgaga ccgccgcctg gatgttcggc ttagaccccg ctctggtgga tccaaagatg 
cgccgggcgg ccaagacggt caacttcggc gtcctctacg ggatgtccgc ccacaggctc 
tcccaggagc tcggcataga ctacaaggag gcggaggcct ttattgagcg ctacttccag 
agcttcccca aggtgcgggc ctggatagaa aggaccctgg aggagggccg gacgcggggc 
tacgtggaga ccctgttcgg caggaggcgc tatgtgcccg acctggcctc ccgggtccgc 
tcggtgcggg aggcggcgga gcggatggcc ttcaacatgc ccgtgcaggg caccgccgcc 
gacctgatga agatcgccat ggtcaagctc ttccccaggc taaagcccct gggggcccac 
ctcctcctcc aagtgcacaa cgagctggtc ctggaggtgc ccgaggaccg ggccgaggag 
gccaaggccc tggtcaagga ggtcatggag aacgcctacc ccctggacgt gcccctcgag 2460 
gtggaggtgg gcgtgggtcg ggactggctg gaggcgaagc aggat 2505 



1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 
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<210> 351 

<211> 835 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 351 

Met Asn Ser Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 

Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg He Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn He Pro Gly Val Ala Gly He Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys He Glu Ala His Leu Glu Asp 
225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg He Arg Thr Asp Leu Pro Leu 
245 250 255 
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Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly Arg Val 
325 330 335 

His Arg Ala Thr Ser Pro Val Glu Ala Leu Ala Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Phe Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Val Ala Leu Asp Pro Thr Asp Asp Pro Leu Leu Val Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ala Asn Thr His Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Phe Thr Glu Asp Ala Ala Glu Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

Phe Gin Asn Leu Phe Lys Arg Leu Ser Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

Gin Glu Val Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Arg Gly Val Arg Leu Asp Val Pro Leu Leu Glu Ala Leu Ser Phe Glu 
450 455 460 

Leu Glu Lys Glu Met Glu Arg Leu Glu Gly Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Gly Leu Thr Pro Val Gly Arg Thr Gin Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ala Gin Gly Ala Leu Glu Ala Leu Arg Gly Ala His 
515 520 525 

Pro He Val Glu Leu He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys 
530 535 540 

Ser Thr Tyr Leu Asp Pro Leu Pro Arg Leu Val His Pro Arg Thr Gly 
545 550 555 560 
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Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg He Arg Lys Ala Phe Val Ala Glu Glu Gly Trp Leu Leu 
595 600 605 

Leu Ala Ala Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu Lys Arg Val Phe Arg Glu Gly Lys Asp He 
625 630 635 640 

His Thr Glu Thr Ala Ala Trp Met Phe Gly Leu Asp Pro Ala Leu Val 
645 650 655 

Asp Pro Lys Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Gly He Asp Tyr 
675 680 685 

Lys Glu Ala Glu Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp He Glu Arg Thr Leu Glu Glu Gly Arg Thr Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala 
725 730 735 

Ser Arg Val Arg Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys He Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Lys Pro Leu Gly Ala His Leu Leu Leu Gin 
770 775 780 

Val His Asn Glu Leu Val Leu Glu Val Pro Glu Asp Arg Ala Glu Glu 
785 790 795 800 

Ala Lys Ala Leu Val Lys Glu Val Met Glu Asn Ala Tyr Pro Leu Asp 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Val Gly Arg Asp Trp Leu Glu Ala 
820 825 830 

Lys Gin Asp 
835 

<210> 352 
<211> 2496 
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<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 352 

atgaattccc tgcccctctt tgagcccaag ggccgggtgc ttctggtgga cggccaccac 
ctggcctacc gtaccttttt tgccctgaag ggcctcacca ccagccgcgg ggagccggtc 
caggcggtgt acgggtttgc caagagcctt ttgaaggcgc taagggaaga cggggatgtg 
gtgatcgtgg tgtttgacgc caaggccccc tccttccgcc accagaccta cgaggcctac 
aaggcggggc gggctcccac ccccgaggac tttccccggc agcttgccct tatcaaggag 300 
atggtggacc ttttgggctt tacccgcctc gaggtgccgg gctttgaagc ggatgacgtc 
ctggctaccc tggccaagaa ggcggaaaag gaaggctacg aagtgcgcat cctcaccgcg 
gaccgggacc tttaccagct tctttcggag cgaatctcca tccttcaccc ggagggttac 
ctgatcaccc cggagtggct ttgggagaag tatgggctta agccttccca gtgggtggac 
taccgggcct tggccgggga cccttccgac aacatccccg gcgtgaaggg catcggggag 
aagacggcgg ccaagctgat ccgggagtgg ggaagcctgg aaaaccttct taagcacctg 
gaacaggtga aacctgcctc cgtgcgggag aagatcctta gccacatgga ggacctcaag 
ctatccctgg agctatcccg ggtgcacacg gacttgctcc ttcaggtgga cttcgcccgg 
cgccgggagc cggaccggga ggggcttaag gcctttttgg agaggctgga gttcggaagc 
ctcctccacg agttcggcct gttggaaagc ccggtggcgg cggaggaagc tccctggccg 
ccccccgagg gagccttcgt ggggtacgtt ctttcccgcc ccgagcccat gtgggcggag 
cttaacgcct tggccgccgc ctggggcggc cgcgtttacc gggcggagga tcccttggag 
gccttgcggg ggcttgggga ggtgaggggg cttttggcca aggacctggc ggtgctggcc 10 80 
ctgagggaag ggattgccct ggcaccgggc gacgacccca tgctcctcgc ctacctcctg 
gatccttcca acaccgcccc cgaaggggta gcccggcgct acggggggga gtggaccgag 
gaggcggggg aaagggcgct gctttccgaa aggctttacg ccgccctcct gaagcggctt 
aagggggagg agaggcttct ttggctttac gaggaggtgg aaaagcccct ttcgcgggtc 132 0 
ctggcccaca tggaggccac gggggtacgg ttggatgtgg cctacttaaa ggccctttcc 13 80 
ctggaggtgg aggcggagat aaggcgcttc gaggaggagg tccaccgcct ggccgggcat 1440 
cctttcaacc tgaactcccg ggaccagctg gaaagggtca tctttgacga gcttgggctt 



60 
120 
180 
240 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



1140 
1200 
1260 



1500 
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cccgccatcg 


gcaagacgca 


gaagacgggc 


aagcgctcca 


ccagcgccgc 


cgttttggag 


1560 




gccttgcggg 


aggctcatcc 


catcgtggac 


cgcatccttc 


agtaccggga 


gctttccaag 


1620 




ctcaagggaa 


cctacatcga 


tcccttgcct 


gccctggtcc 


accccaagac 


gaaccgcctc 


1680 




cacacccgtt 


tcaaccagac 


ggccaccgcc 


acggggaggc 


ttagcagctc 


ggatcctaat 


1740 




ctgcaaaata 


tccccgtgcg 


cacccctttg 


ggccagcgga 


tccgccgggc 


cttcgtggcc 


1800 




gaggaggggt 


ggaggctggt 


ggttttggac 


tacagccaga 


ttgagctcag 


ggtcctggcg 


1860 




cacctttccg 


gggacgagaa 


cctaatccgg 


gtcttccagg 


agggccagga 


catccacacc 


1920 




cagacggcca 


gctggatgtt 


cggcgtgccc 


ccagaggccg 


tggattccct 


gatgcgccgg 


1980 




gcggccaaga 


ccatcaactt 


cggcgtcctc 


tacggcatgt 


ccgcccaccg 


gctttcggga 


2040 




gagctggcca 


tcccctacga 


ggaggcggtg 


gccttcatcg 


agcggtattt 


ccagagctac 


2100 




cccaaggtgc 


gggcctggat 


tgagaaaacc 


ctggcggaag 


gacgggaacg 


gggctatgtg 


2160 


if™ 

■i^ " 


gaaaccctct 


ttggccgccg 


gcgctacgtg 


cccgacttgg 


cttcccgggt 


gaagagcatc 


2220 




cgggaggcag 


cggagcgcat 


ggccttcaac 


atgccggtcc 


aggggaccgc 


cgcggatttg 


2280 




atgaaactgg 


ccatggtgaa 


gctctttccc 


aggcttcagg 


agctgggggc 


caggatgctt 


o T A n 

Z J -4: u 




ttgcaggtgc 


acaacgaact 


ggtcctcgag 


gctcccaagg 


agcaagcgga 


ggaagtcgcc 


2400 




caggaggcca 


agcggaccat 


ggaggaggtg 


tggcccctga 


aggtgccctt 


ggaggtggaa 


2460 




gtgggcatcg 


999^9g^ctg 


gctttccgcc 


aaggcc 
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<210> 353 

<211> 832 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 353 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 
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Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 



Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu lie Lys Glu Met Val Asp Leu Leu Gly Phe Thr Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg lie Ser lie Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn lie 
180 185 190 

Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Ala Lys Leu lie Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 215 220 

Pro Ala Ser Val Arg Glu Lys lie Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val Tyr Arg Ala Glu 
325 330 335 

Asp Pro Leu Glu Ala Leu Arg Gly Leu Gly Glu Val Arg Gly Leu Leu 
340 345 350 



Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly lie Ala Leu Ala 
355 360 365 
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Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 



Thr Ala Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Glu Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu Tyr Ala Ala Leu 
405 410 415 

Leu Lys Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp Leu Tyr Glu Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Lys Ala Leu Ser Leu Glu Val Glu 
450 455 460 

Ala Glu lie Arg Arg Phe Glu Glu Glu Val His Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val lie Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Gin Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro lie 
515 520 525 

Val Asp Arg lie Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys Gly Thr 
530 535 540 

Tyr lie Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Asn Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Arg Leu Val Val 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Gin Asp He His Thr 
625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Ser 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 
660 665 670 



730 



Met Ser Ala His 
675 

Ala Val Ala Phe 
690 

Ala Trp lie Glu 
705 

Glu Thr Leu Phe 



Val Lys Ser lie 
740 

Val Gin Gly Thr 
755 

Phe Pro Arg Leu 
770 

Asn Glu Leu Val 
785 

Gin Glu Ala Lys 



Leu Glu Val Glu 
820 



Arg Leu Ser Gly 
680 

lie Glu Arg Tyr 
695 

Lys Thr Leu Ala 
710 

Gly Arg Arg Arg 
725 

Arg Glu Ala Ala 



Ala Ala Asp Leu 
760 

Gin Glu Leu Gly 
775 

Leu Glu Ala Pro 
790 

Arg Thr Met Glu 
805 

Val Gly lie Gly 



Glu Leu Ala lie 



Phe Gin Ser Tyr 
700 

Glu Gly Arg Glu 
715 

Tyr Val Pro Asp 
730 

Glu Arg Met Ala 
745 

Met Lys Leu Ala 



Ala Arg Met Leu 
780 

Lys Glu Gin Ala 
795 

Glu Val Trp Pro 
810 

Glu Asp Trp Leu 
825 



Pro Tyr Glu Glu 
685 

Pro Lys Val Arg 



Arg Gly Tyr Val 
720 

Leu Ala Ser Arg 
735 

Phe Asn Met Pro 
750 

Met Val Lys Leu 
765 

Leu Gin Val His 



Glu Glu Val Ala 
800 

Leu Lys Val Pro 
815 

Ser Ala Lys Ala 
830 



<210> 354 

<211> 42 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 354 

ggcctcaccc cggtgaagcg gacgaagaag acgggcaagc gc 



<210> 


355 


<211> 


42 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


355 



gcgcttgccc gtcttcttcg tccgcttcac cggggtgagg cc 



<210> 356 
<211> 33 
<212> DNA 
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<213> Artificial 
<220> 

<223> Synthetic 
<400> 356 

ctcctcctcc aagtggccaa cgagctggtc ctg 33 



<210> 357 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 357 

caggaccagc tcgttggcca cttggaggag gag 3 3 



<210> 358 
<211> 2505 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 358 

atgaattcca ccccactttt tgacctggag gaacccccca agcgggtgct tctggtggac 60 

ggccaccacc tggcctaccg caccttctat gccctgagcc tcaccacctc ccggggggag 12 0 

ccggtgcaga tggtctacgg cttcgcccgg agcctcctca aggccttgaa ggaggacgga 180 

caggcggtgg tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacgag 240 

gcctacaagg cgggccgggc ccccaccccg gaggacttcc cccgccagct cgccttggtc 300 

aagcggctgg tggaccttct gggcctggtc cgcctcgagg ccccggggta cgaggcggac 360 

gacgtcctgg gcaccctggc caagaaggcc gaaagggagg ggatggaggt gcgcatcctc 42 0 

acgggagacc gggacttctt ccagctcctc tccgagaagg tctcggtcct cctgccggac 480 

gggaccctgg tcaccccaaa ggacgtccag gagaagtacg gggtgccccc ggagcgctgg 540 

gtggacttcc gcgccctcac gggggaccgc tcggacaaca tccccggggt ggcggggata 600 

gggg^g^aga ccgcccttcg actcctcgca gagtggggga gcgtggaaaa cctcctgaag 660 

aacctggacc gggtaaagcc ggactcgctc cggcgcaaga tagaggcgca cctcgaggac 720 

ctccacctct ccttagacct ggcccgcatc cgcaccgacc tccccctgga ggtggacttt 780 

aaggccctgc gccgcaggac ccccgacctg gagggcctga gggccttttt ggaggagctg 840 
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gagttcggaa gcctcctcca cgagttcggc ctcctgggag gggagaagcc ccgggaggag 900 

gccccctggc ccccgcccga aggggccttc gtgggcttcc tcctttcccg caaggagccc 960 

^tgtgggcgg agcttctggc cctggcggcg gcctcgggcg gccgcgtcca ccgggcaaca 1020 

agcccggttg aggccctggc cgacctcaag gaggcccggg ggttcctggc caaggacctg 1080 

gccgttttgg ccctgcggga gggggtggcc ctggacccca cggacgaccc cctcctggtg 1140 

gcctacctcc tggacccggc caacacccac cccgaggggg tggcccggcg ctacgggggc 12 00 

gagttcacgg aggacgcagc ggagagggcc ctcctctccg agaggctctt ccagaacctc 12 60 

tttaaacggc tttccgagaa gctcctctgg ctctaccagg aggtggagcg gcccctctcc 1320 

cgggtcttgg cccacatgga ggcccggggg gtgaggctgg acgtccccct tctggaggcc 13 80 

ctctcctttg agctggagaa ggagatggag cgcctggagg gggaggtctt ccgtttggcc 1440 

ggccacccct tcaacctcaa ctcccgcgac cagctggaaa gggtcctctt tgacgagctg 1500 

ggcctcaccc cggtgaagcg gacgaagaag acgggcaagc gctccaccgc ccagggggcc 1560 

ctggaggccc tccggggggc ccaccccatc gtggagctca tcctccagta ccgggagctt 1620 

tccaagctca aaagcaccta cctggacccc ctgccccggc tcgtccaccc gcggacgggc 168 0 

cggctccaca cccgcttcaa ccagacggcc acggccacgg gaaggctttc cagctccgac 1740 

cccaacctgc agaacatccc cgtgcgcacc cccttggggc agcgcatccg caaggccttc 1800 

gtggccgagg aggggtggct ccttttggcg gcggactact cccagattga gctccgggtc 1860 

ctggcccacc tctcggggga cgagaacctg aagcgggtct tccgggaggg gaaggacatc 192 0 

cataccgaga ccgccgcctg gatgttcggc ttagaccccg ctctggtgga tccaaagatg 1980 

cgccgggcgg ccaagacggt caacttcggc gtcctctacg ggatgtccgc ccacaggctc 2040 

tcccaggagc tcggcataga ctacaaggag gcggaggcct ttattgagcg ctacttccag 2100 

agcttcccca aggtgcgggc ctggatagaa aggaccctgg aggagggccg gacgcggggc 2160 

tacgtggaga ccctgttcgg caggaggcgc tatgtgcccg acctggcctc ccgggtccgc 222 0 

tcggtgcggg aggcggcgga gcggatggcc ttcaacatgc ccgtgcaggg caccgccgcc 2 2 80 

gacctgatga agatcgccat ggtcaagctc ttccccaggc taaagcccct gggggcccac 2340 

ctcctcctcc aagtggccaa cgagctggtc ctggaggtgc ccgaggaccg ggccgaggag 24 00 

gccaaggccc tggtcaagga ggtcatggag aacgcctacc ccctggacgt gcccctcgag 2460 

gtggaggtgg gcgtgggtcg ggactggctg gaggcgaagc aggat 2 505 
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<210> 359 

<211> 835 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 359 

Met Asn Ser Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 

Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg lie Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn lie Pro Gly Val Ala Gly lie Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys lie Glu Ala His Leu Glu Asp 
225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg lie Arg Thr Asp Leu Pro Leu 
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245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly Arg Val 
325 330 335 

His Arg Ala Thr Ser Pro Val Glu Ala Leu Ala Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Phe Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Val Ala Leu Asp Pro Thr Asp Asp Pro Leu Leu Val Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ala Asn Thr His Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Phe Thr Glu Asp Ala Ala Glu Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

Phe Gin Asn Leu Phe Lys Arg Leu Ser Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

Gin Glu Val Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Arg Gly Val Arg Leu Asp Val Pro Leu Leu Glu Ala Leu Ser Phe Glu 
450 455 460 

Leu Glu Lys Glu Met Glu Arg Leu Glu Gly Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Gly Leu Thr Pro Val Lys Arg Thr Lys Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ala Gin Gly Ala Leu Glu Ala Leu Arg Gly Ala His 
515 520 525 

Pro He Val Glu Leu He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys 
530 535 540 

Ser Thr Tyr Leu Asp Pro Leu Pro Arg Leu Val His Pro Arg Thr Gly 
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545 



550 



555 



560 



Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg lie Arg Lys Ala Phe Val Ala Glu Glu Gly Trp Leu Leu 
595 600 605 

Leu Ala Ala Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu Lys Arg Val Phe Arg Glu Gly Lys Asp lie 
625 630 635 640 

His Thr Glu Thr Ala Ala Trp Met Phe Gly Leu Asp Pro Ala Leu Val 
645 650 655 

Asp Pro Lys Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Gly lie Asp Tyr 
675 680 685 

Lys Glu Ala Glu Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp lie Glu Arg Thr Leu Glu Glu Gly Arg Thr Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala 
725 730 735 

Ser Arg Val Arg Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys lie Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Lys Pro Leu Gly Ala His Leu Leu Leu Gin 
770 775 780 

Val Ala Asn Glu Leu Val Leu Glu Val Pro Glu Asp Arg Ala Glu Glu 
785 790 795 800 

Ala Lys Ala Leu Val Lys Glu Val Met Glu Asn Ala Tyr Pro Leu Asp 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Val Gly Arg Asp Trp Leu Glu Ala 
820 825 830 

Lys Gin Asp 
835 

<210> 360 
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<211> 42 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 360 

gggcttcccg ccatcaagaa gacgaagaag acgggcaagc go 



<210> 361 

<211> 42 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 361 

gcgcttgccc gtcttcttcg tcttcttgat ggcgggaagc cc 



<210> 362 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 362 

atgcttttgc aggtggccaa cgaactggtc etc 



<210> 363 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 363 

gaggaccagt tcgttggcca cctgcaaaag cat 



<210> 364 

<211> 2496 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 364 

atgaattccc tgcccctctt tgagcccaag ggccgggtgc ttctggtgga cggc 
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ctggcctacc gtaccttttt tgccctgaag ggcctcacca ccagccgcgg ggagccggtc 12 0 

caggcggtgt acgggtttgc caagagcctt ttgaaggcgc taagggaaga cggggatgtg 18 0 

gtgatcgtgg tgtttgacgc caaggccccc tccttccgcc accagaccta cgaggcctac 240 

aaggcggggc gggctcccac ccccgaggac tttccccggc agcttgccct tatcaaggag 3 00 

atggtggacc ttttgggcct ggagcgcctc gaggtgccgg gctttgaagc ggatgacgtc 360 

ctggctaccc tggccaagaa ggcggaaaag gaaggctacg aagtgcgcat cctcaccgcg 42 0 

gaccgggacc tttaccagct tctttcggag cgaatctcca tccttcaccc ggagggttac 48 0 

ctgatcaccc cggagtggct ttgggagaag tatgggctta agccttccca gtgggtggac 54 0 

taccgggcct tggccgggga cccttccgac aacatccccg gcgtgaaggg catcggggag 60 0 

aagacggcgg ccaagctgat ccgggagtgg ggaagcctgg aaaaccttct taagcacctg 660 

gaacaggtga aacctgcctc cgtgcgggag aagatcctta gccacatgga ggacctcaag 72 0 

ctatccctgg agctatcccg ggtgcacacg gacttgctcc ttcaggtgga cttcgcccgg 780 

cgccgggagc cggaccggga ggggcttaag gcctttttgg agaggctgga gttcggaagc 84 0 

ctcctccacg agttcggcct gttggaaagc ccggtggcgg cggaggaagc tccctggccg 900 

ccccccgagg gagccttcgt ggggtacgtt ctttcccgcc ccgagcccat gtgggcggag 960 

cttaacgcct tggccgccgc ctggggcggc cgcgtttacc gggcggagga tcccttggag 102 0 

gccttgcggg ggcttgggga ggtgaggggg cttttggcca aggacctggc ggtgctggcc 1080 

ctgagggaag ggattgccct ggcaccgggc gacgacccca tgctcctcgc ctacctcctg 1140 

gatccttcca acaccgcccc cgaaggggta gcccggcgct acggggggga gtggaccgag 1200 

gaggcggggg aaagggcgct gctttccgaa aggctttacg ccgccctcct gaagcggctt 1260 

aagggggagg agaggcttct ttggctttac gaggaggtgg aaaagcccct ttcgcgggtc 1320 

ctggcccaca tggaggccac gggggtacgg ttggatgtgg cctacttaaa ggccctttcc 13 80 

ctggaggtgg aggcggagat aaggcgcttc gaggaggagg tccaccgcct ggccgggcat 144 0 

cctttcaacc tgaactcccg ggaccagctg gaaagggtca tctttgacga gcttgggctt 1500 

cccgccatca agaagacgag gaagacgggc aagcgctcca ccagcgccgc cgttttggag 1560 

gccttgcggg aggctcatcc catcgtggac cgcatccttc agtaccggga gctttccaag 162 0 

ctcaagggaa cctacatcga tcccttgcct gccctggtcc accccaagac gaaccgcctc 1680 

cacacccgtt tcaaccagac ggccaccgcc acggggaggc ttagcagctc ggatcctaat 174 0 
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ctgcaaaata 


tccccgtgcg 


cacccctttg 


ggccagcgga 


tccgccgggc 


cttcgtggcc 


1800 


gaggaggggt 


ggaggctggt 


ggttttggac 


tacagccaga 


ttgagctcag 


ggtcctggcg 


1860 


cacctttccg 


gggacgagaa 


cctaatccgg 


gtcttccagg 


agggccagga 


catccacacc 


1920 


cagacggcca 


gctggatgtt 


cggcgtgccc 


ccagaggccg 


tggattccct 


gatgcgccgg 


1980 


gcggccaaga 


ccatcaactt 


cggcgtcctc 


tacggcatgt 


ccgcccaccg 


gctttcggga 


2040 


gagctggcca 


tcccctacga 


ggaggcggtg 


gccttcatcg 


agcggtattt 


ccagagctac 


2100 


cccaaggtgc 


gggcctggat 


tgagaaaacc 


ctggcggaag 


gacgggaacg 


gggctatgtg 


2160 


gaaaccctct 


ttggccgccg 


gcgctacgtg 


cccgacttgg 


cttcccgggt 


gaagagcatc 


2220 


cgggaggcag 


cggagcgcat 


ggccttcaac 


atgccggtcc 


aggggaccgc 


cgcggatttg 


2280 


atgaaactgg 


ccatggtgaa 


gctctttccc 


aggcttcagg 


agctgggggc 


caggatgctt 


2340 


ttgcaggtgc 


acaacgaact 


ggtcctcgag 


gctcccaagg agcaagcgga ggaagtcgcc 


2400 


caggaggcca 


agcggaccat 


ggaggaggtg 


tggcccctga 


aggtgccctt 


ggaggtggaa 


2460 


gtgggcatcg gggaggactg gctttccgcc 


aaggcc 






2496 



<210> 365 

<211> 832 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 365 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu lie Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 
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Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg He Ser He Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn He 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Ala Lys Leu He Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 215 220 

Pro Ala Ser Val Arg Glu Lys He Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val Tyr Arg Ala Glu 
325 330 335 

Asp Pro Leu Glu Ala Leu Arg Gly Leu Gly Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly He Ala Leu Ala 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Ala Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Glu Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu Tyr Ala Ala Leu 
405 410 415 



740 



Leu Lys Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp Leu Tyr Glu Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Lys Ala Leu Ser Leu Glu Val Glu 
450 455 460 

Ala Glu He Arg Arg Phe Glu Glu Glu Val His Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val He Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala He Lys Lys Thr Arg Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 520 525 

Val Asp Arg He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys Gly Thr 
530 535 540 

Tyr He Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Asn Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Arg Leu Val Val 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Gin Asp He His Thr 
625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Ser 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gly Glu Leu Ala He Pro Tyr Glu Glu 
675 680 685 

Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Tyr Pro Lys Val Arg 
690 695 700 

Ala Trp He Glu Lys Thr Leu Ala Glu Gly Arg Glu Arg Gly Tyr Val 
705 710 715 720 
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Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala Ser Arg 
725 730 735 

val Lys Ser He Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu Leu Gin Val His 
770 775 780 

Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Gin Ala Glu Glu Val Ala 
785 790 795 800 

Gin Glu Ala Lys Arg Thr Met Glu Glu Val Trp Pro Leu Lys Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys Ala 
820 825 830 

<210> 366 
<211> 2505 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 366 

atgaattcca ccccactttt tgacctggag gaacccccca agcgggtgct tctggtggac 



ggccaccacc tggcctaccg caccttctat gccctgagcc tcaccacctc ccggggggag 
ccggtgcaga tggtctacgg cttcgcccgg agcctcctca aggccttgaa ggaggacgga 
caggcggtgg tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacgag 
gcctacaagg cgggccgggc ccccaccccg gaggacttcc cccgccagct cgccttggtc 
aagcggctgg tggaccttct gggctttacc cgcctcgagg ccccggggta cgaggcggac 
gacgtcctgg gcaccctggc caagaaggcc gaaagggagg ggatggaggt gcgcatcctc 
acgggagacc gggacttctt ccagctcctc tccgagaagg tctcggtcct cctgccggac 
gggaccctgg tcaccccaaa ggacgtccag gagaagtacg gggtgccccc ggagcgctgg 540 
gtggacttcc gcgccctcac gggggaccgc tcggacaaca tccccggggt ggcggggata 600 
ggggagaaga ccgcccttcg actcctcgca gagtggggga gcgtggaaaa cctcctgaag 660 
aacctggacc gggtaaagcc ggactcgctc cggcgcaaga tagaggcgca cctcgaggac 
ctccacctct ccttagacct ggcccgcatc cgcaccgacc tccccctgga ggtggacttt 



60 
120 
180 
240 
300 
360 
420 
480 



720 
780 
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aaggccctgc 


gccgcaggac 


ccccgacctg gagggcctga gggccttttt ggaggagctg 


840 


gagttcggaa 


gcctcctcca 


cgagttcggc 


ctcctgggag 


gggagaagcc 


ccgggaggag 


900 


gccccctggc 


ccccgcccga 


aggggccttc 


gtgggcttcc 


tcctttcccg 


caaggagccc 


960 


atgtgggcgg 


agcttctggc 


cctggcggcg 


gcctcgggcg 


gccgcgtcca 


ccgggcaaca 


1020 


agcccggttg 


aggccctggc 


cgacctcaag 


gaggcccggg 


ggttcctggc 


caaggacctg 


1080 


gccgttttgg 


ccctgcggga 


gggggtggcc 


ctggacccca 


cggacgaccc 


cctcctggtg 


1140 


gcctacctcc 


tggacccggc 


caacacccac 


cccgaggggg 


tggcccggcg 


ctacgggggc 


1200 


gagttcacgg 


aggacgcagc 


ggagagggcc 


ctcctctccg 


agaggctctt 


ccagaacctc 


1260 


tttaaacggc 


tttccgagaa 


gctcctctgg 


ctctaccagg 


aggtggagcg 


gcccctctcc 


1320 


cgggtcttgg 


cccacatgga 


ggcccggggg 


gtgaggctgg 


acgtccccct 


tctggaggcc 


1380 


ctctcctttg 


agctggagaa 


ggagatggag 


cgcctggagg 


gggaggtctt 


ccgtttggcc 


1440 


ggccacccct 


tcaacctcaa 


ctcccgcgac 


cagctggaaa 


gggtcctctt 


tgacgagctg 


1500 


ggcctcaccc 


cggtgaagcg 


gacgaagaag 


acgggcaagc 


gctccaccgc 


ccagggggcc 


1560 


ctggaggccc 


tccggggggc 


ccaccccatc 


gtggagctca 


tcctccagta 


ccgggagctt 


1620 


tccaagctca 


aaagcaccta 


cctggacccc 


ctgccccggc 


tcgtccaccc 


gcggacgggc 


1680 


cggctccaca 


cccgcttcaa 


ccagacggcc 


acggccacgg gaaggctttc 


cagctccgac 


1740 


cccaacctgc 


agaacatccc 


cgtgcgcacc 


cccttggggc 


agcgcatccg 


caaggccttc 


1800 


gtggccgagg 


aggggtggct 


ccttttggcg 


gcggactact 


cccagattga 


gctccgggtc 


1860 


ctggcccacc 


tctcggggga 


cgagaacctg 


aagcgggtct 


tccgggaggg 


gaaggacatc 


1920 


cataccgaga 


ccgccgcctg gatgttcggc 


ttagaccccg 


ctctggtgga 


tccaaagatg 


1980 


cgccgggcgg 


ccaagacggt 


caacttcggc 


gtcctctacg 


ggatgtccgc 


ccacaggctc 


2040 


tcccaggagc 


tcggcataga 


ctacaaggag 


gcggaggcct 


ttattgagcg 


ctacttccag 


2100 


agcttcccca 


aggtgcgggc 


ctggatagaa 


aggaccctgg 


aggagggccg 


gacgcggggc 


2160 


tacgtggaga 


ccctgttcgg 


caggaggcgc 


tatgtgcccg 


acctggcctc 


ccgggtccgc 


2220 


tcggtgcggg 


aggcggcgga 


gcggatggcc 


ttcaacatgc 


ccgtgcaggg 


caccgccgcc 


2280 


aarctcrataa 


agatcgccat 


ggtcaagctc 


ttccccaggc 


taaagcccct 


gggggcccac 


2340 


ctcctcctcc 


aagtggccaa 


cgagctggtc 


ctggaggtgc 


ccgaggaccg 


ggccgaggag 


2400 


gccaaggccc 


tggtcaagga 


ggtcatggag 


aacgcctacc 


ccctggacgt 


gcccctcgag 


2460 


gtggaggtgg 


gcgtgggtcg 


ggactggctg 


gaggcgaagc 


aggat 




2505 
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<210> 367 

<211> 835 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 367 

Met Asn Ser Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 

Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg He Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn He Pro Gly Val Ala Gly He Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys He Glu Ala His Leu Glu Asp 
225 230 235 240 
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Leu His Leu Ser Leu Asp Leu Ala Arg He Arg Thr Asp Leu Pro Leu 
245 250 255 



Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly Arg Val 
325 330 335 

His Arg Ala Thr Ser Pro Val Glu Ala Leu Ala Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Phe Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Val Ala Leu Asp Pro Thr Asp Asp Pro Leu Leu Val Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ala Asn Thr His Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Phe Thr Glu Asp Ala Ala Glu Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

Phe Gin Asn Leu Phe Lys Arg Leu Ser Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

Gin Glu Val Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Arg Gly Val Arg Leu Asp Val Pro Leu Leu Glu Ala Leu Ser Phe Glu 
450 455 460 

Leu Glu Lys Glu Met Glu Arg Leu Glu Gly Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Gly Leu Thr Pro Val Lys Arg Thr Lys Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ala Gin Gly Ala Leu Glu Ala Leu Arg Gly Ala His 
515 520 525. 

Pro He Val Glu Leu He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys 
530 535 540 
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Ser Thr Tyr Leu Asp Pro Leu Pro Arg Leu Val His Pro Arg Thr Gly 
545 550 555 560 



Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg lie Arg Lys Ala Phe Val Ala Glu Glu Gly Trp Leu Leu 
595 600 605 

Leu Ala Ala Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu Lys Arg Val Phe Arg Glu Gly Lys Asp lie 
625 630 635 640 

His Thr Glu Thr Ala Ala Trp Met Phe Gly Leu Asp Pro Ala Leu Val 
645 650 655 

Asp Pro Lys Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Gly lie Asp Tyr 
675 680 685 

Lys Glu Ala Glu Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp lie Glu Arg Thr Leu Glu Glu Gly Arg Thr Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala 
725 730 735 

Ser Arg Val Arg Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys He Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Lys Pro Leu Gly Ala His Leu Leu Leu Gin 
770 775 780 

Val Ala Asn Glu Leu Val Leu Glu Val Pro Glu Asp Arg Ala Glu Glu 
785 790 795 800 

Ala Lys Ala Leu Val Lys Glu Val Met Glu Asn Ala Tyr Pro Leu Asp 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Val Gly Arg Asp Trp Leu Glu Ala 
820 825 830 



Lys Gin Asp 
835 
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<210> 368 

<211> 2496 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 368 





atgaattccc 


tgcccctctt 


tgagcccaag 


ggccgggtgc 


ttctggtgga 


cggccaccac 


60 




ctggcctacc 


gtaccttttt 


tgccctgaag 


ggcctcacca 


ccagccgcgg 


ggagccggtc 


120 




caggcggtgt 


acgggtttgc 


caagagcctt 


ttgaaggcgc 


taagggaaga 


cggggatgtg 


180 




gtgatcgtgg 


tgtttgacgc 


caaggccccc 


tccttccgcc 


accagaccta 


cgaggcctac 


240 




aaggcggggc 


gggctcccac 


ccccgaggac 


tttccccggc 


agcttgccct 


tatcaaggag 


300 




atggtggacc 


ttttgggctt 


tacccgcctc 


gaggtgccgg 


gctttgaagc 


ggatgacgtc 


360 




ctggctaccc 


tggccaagaa 


ggcggaaaag 


gaaggctacg 


aagtgcgcat 


cctcaccgcg 


420 




gaccgggacc 


tttaccagct 


tctttcggag 


cgaatctcca 


tccttcaccc 


ggagggttac 


480 


m 


ctgatcaccc 


cgg^gtggct 


ttgggagaag 


tatgggctta 


agccttccca 


gtgggtggac 


540 




taccgggcct 


tggccgggga 


cccttccgac 


aacatccccg 


gcgtgaaggg 


catcggggag 


600 




aagacggcgg 


ccaagctgat 


ccgggagtgg 


ggaagcctgg 


aaaaccttct 


taagcacctg 


660 


i 


gaacaggtga 


aacctgcctc 


cgtgcgggag 


aagatcctta 


gccacatgga 


ggacctcaag 


720 




ctatccctgg 


agctatcccg 


ggtgcacacg 


gacttgctcc 


ttcaggtgga 


cttcgcccgg 


780 




cgccgggagc 


cggaccggga 


ggggcttaag 


gcctttttgg 


agaggctgga 


gttcggaagc 


840 




ctcctccacg 


agttcggcct 


gttggaaagc 


ccggtggcgg 


cggaggaagc 


tccctggccg 


900 




ccccccgagg 


gagccttcgt 


ggggtacgtt 


ctttcccgcc 


ccgagcccat 


gtgggcggag 


960 




cttaacgcct 


tggccgccgc 


ctggggcggc 


cgcgtttacc 


gggcggagga 


tcccttggag 


1020 




gccttgcggg 


ggcttgggga 


ggtgaggggg 


cttttggcca 


aggacctggc 


ggtgctggcc 


1080 




ctgagggaag 


ggattgccct 


ggcaccgggc 


gacgacccca 


tgctcctcgc 


ctacctcctg 


1140 




gatccttcca 


acaccgcccc 


cgaaggggta 


gcccggcgct 


acggggggga 


gtggaccgag 


1200 




gaggcggggg 


aaagggcgct 


gctttccgaa 


aggctttacg 


ccgccctcct 


gaagcggctt 


1260 




aagggggagg 


agaggcttct 


ttggctttac 


gaggaggtgg 


aaaagcccct 


ttcgcgggtc 


1320 




ctggcccaca 


tggaggccac 


gggggtacgg 


ttggatgtgg 


cctacttaaa 


ggccctttcc 


1380 




ctggaggtgg 


aggcggagat 


aaggcgcttc 


gaggaggagg 


tccaccgcct 


ggccgggcat 


1440 
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cctttcaacc 


tgaactcccg 


ggaccagctg 


gaaagggtca 


tctttgacga 


gcttgggctt 


cccgccatca 


agaagacgag 


gaagacgggc 


aagcgctcca 


ccagcgccgc 


cgttttggag 


gccttgcggg 


aggctcatcc 


catcgtggac 


cgcatccttc 


agtaccggga 


gctttccaag 


ctcaagggaa 


cctacatcga 


tcccttgcct 


gccctggtcc 


accccaagac 


gaaccgcctc 


cacacccgtt 


tcaaccagac 


ggccaccgcc 


^cggggaggc 


ttagcagctc 


ggatcctaat 


ctgcaaaata 


tccccgtgcg 


cacccctttg 


ggccagcgga 


tccgccgggc 


cttcgtggcc 


gaggaggggt 


ggaggctggt 


ggttttggac 


tacagccaga 


ttgagctcag 


ggtcctggcg 


cacctttccg 


gggacgagaa 


cctaatccgg 


gtcttccagg 


agggccagga 


catccacacc 


cagacggcca 


gctggatgtt 


cggcgtgccc 


ccagaggccg 


tggattccct 


gatgcgccgg 


gcggccaaga 


ccatcaactt 


cggcgtcctc 


tacggcatgt 


ccgcccaccg 


gctttcggga 


gagctggcca 


tcccctacga 


gg^ggcggtg 


gccttcatcg 


agcggtattt 


ccagagctac 


cccaaggtgc 


gggcctggat 


tgagaaaacc 


ctggcggaag 


gacgggaacg 


gggctatgtg 


gaaaccctct 


ttggccgccg 


qcgctacgtg 


cccgacttgg 


cttcccgggt 


gaagagcatc 


cgggaggcag 


cggagcgcat 


ggccttcaac 


atgccggtcc 


aggggaccgc 


cgcggatttg 


atgaaactgg 


ccatggtgaa 


gctctttccc 


aggcttcagg 


agctgggggc 


caggatgctt 


ttgcaggtgc 


acaacgaact 


ggtcctcgag 


gctcccaagg 


agcaagcgga 


ggaagtcgcc 


caggaggcca 


agcggaccat 


ggaggaggtg 


tggcccctga 


aggtgccctt 


ggaggtggaa 


gtgggcatcg 


gggaggactg 


gctttccgcc 


aaggcc 







<210> 369 

<211> 832 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 369 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2496 
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50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu He Lys Glu Met Val Asp Leu Leu Gly Phe Thr Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg He Ser He Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn He 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Ala Lys Leu He Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 ,215 220 

Pro Ala Ser Val Arg Glu Lys He Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val Tyr Arg Ala Glu 
325 330 335 

Asp Pro Leu Glu Ala Leu Arg Gly Leu Gly Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly He Ala Leu Ala 
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355 



360 



365 



Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Ala Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Glu Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu Tyr Ala Ala Leu 
405 410 415 

Leu Lys Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp Leu Tyr Glu Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Lys Ala Leu Ser Leu Glu Val Glu 
450 455 460 

Ala Glu lie Arg Arg Phe Glu Glu Glu Val His Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val lie Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala He Lys Lys Thr Arg Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 520 525 

Val Asp Arg He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys Gly Thr 
530 535 540 

Tyr He Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Asn Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Arg Leu Val Val 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Gin Asp He His Thr 
625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Ser 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 
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660 665 670 

Met Ser Ala His Arg Leu Ser Gly Glu Leu Ala He Pro Tyr Glu Glu 
675 680 685 

Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Tyr Pro Lys Val Arg 
690 695 700 

Ala Trp He Glu Lys Thr Leu Ala Glu Gly Arg Glu Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala Ser Arg 
725 730 735 

Val Lys Ser He Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu Leu Gin Val His 
770 775 780 

Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Gin Ala Glu Glu Val Ala 
785 790 795 800 

Gin Glu Ala Lys Arg Thr Met Glu Glu Val Trp Pro Leu Lys Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys Ala 
820 825 830 



<210> 


370 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


370 


ccctccgaca acctcgccgg 


<210> 


371 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


371 



ccctccgaca acctcaaggg ggtcaagggc ate 



<210> 372 
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<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 372 

9^g9ttgtcg gaggggtc 18 



<210> 373 

<211> 2526 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 373 



p t'ci?^ a i" t" rr'cr 


a.yy^^yca,cyv_L. 




y a.cn_ ov-fCtw-ciy 


crrT'crcfCf t~ c p t~ 
y v^v-y yy uv-,^- u 


cot* crcrt" crcra c 




■f- nt~if~' +*■ f^c^ 

L.yyL.v^Lcn— v-.y 




y coc uycicxyy 


y{_C-L.Ccti^Oci(^ 


yciyoi-yyyyiw- 


y dd-c^cy y Ly u 


^ggcggucua 




aay ay ccuuc 


f- /—I 'a a/~rrTr^/~'/~'t~ 

ucdayy i-cc u 


gaayyagyac 


gggcacaagg 


ccy L-C L. ucgu 


ggt ct ttgac 


gccaaggccc 


/~i 4— 4— 4— ✓"t 

CCL-Ccucccy 


ccacy ayy 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


act tcccccg 


gcagctcgcc 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctcgc 


cggggtcaag 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 




cctggaggag 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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gcctacctcc 


tggacccttc gaacaccacc 


cccgaggggg 


>^99'^9^9y^y 


^ Uduy yyyyy 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


dgagy u L.ccd 


I, ^y y cici^-* c- «— 


1260 


cttaagcgcc 


tcgaggggga ggagaagctc 


ctttggctct 


accacgaggt 


gg ddddy uuv^ 


1320 


ctctcccggg 






accggggtac 


99^999^^91- 


yy^L< Ud\_-^ L- 1— 


1380 


caggcccttt 


r^rr'f' ^ rr r" "h 




atccgccgcc 


ucgaggdyyd 


y y u> ^ L« ^ ^y ^ 


1440 


ttggcgggcc 


CH_. ^ \— ' l— L. 1—. 'w CL Cl 


cct caactcc 


cgggaccagc 


L.ggaaaygy u 


yoL.v_^i- u i_yca.v^ 


1500 


gagcttaggc 


t'iT'PPar'Ptt 


gaagaagacg 


aagaagacag 


gcddy cy o CO 


Vvdv^^^CT.y*^y^^^ 


1560 


gcggtgctgg 


dyy v-o^^ L.CH— y 


V-J Cl.^ ^ ^ ^ 


cccatcgtgg 


agaagat cc t 


Ov^dyod'wV^yy 


1620 


gagctcacca 


^ s 3 /"TO a 
cig O t- clciy ci d 


^ciwv^ Lci^y *— y 


gaccccctcc 


caagccucy l. 


3 rr^ irr^^ cici 
o o d\-- o oy txy y 


1680 


acgggccgcc 


UOt-.ctv-.ci^v^^y 


r' t" 1~ r a a c c a a 


acggccacgg 


ccacygygdy 


OT'I" t aafcaac 

y t_ O I3 


1740 


tccgacccca 


acc ugcciy del 


^^duov^iw^y i-^— 


cgcaccccct 


ugggccdgdy 


ydCooy^\--yy 


1800 


gccttcgtgg 


ccgaggcggg 


*-yyy ^ "-y 


gtggccctgg 


act at age ca 


y d L-dy dy o 1— 


1860 


cgcgtcctcg 




^yyyy "^^y c* t*. 


aacctgatca 


gggucuEcca 


gg^yyyy*^^y 


1920 


gacatccaca 


cccagaccgc 


ddyi— t-yyct i— y 


ttcggcgtcc 


ccccggaggc 


cguggacccL- 


1 Q R n 

X 0 u 


ctgatgcgcc 


gggcggccaa 


gacyy uy ddo 


ttcggcgtcc 


tctacggcat 


y L-\— v^yt— o*^dL. 


2040 


aggctctccc 


aggagct-T-gc 




gaggaggcgg 


tggcctttat 


dy dyoy*-- 1^0.^ 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


y y y y °-y y ^'^^y 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccyy 


222 0 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca acatgcccgt 


ccagggcaL-c- 




gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


0 T /t n 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 


caccac 












2526 



<210> 374 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 374 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Ala Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 
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Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 ^ 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 
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Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 



Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 375 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 375 
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atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg gacccctccg 


acaacctcaa 


gggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


qaqcttaqqc 


ttcccgcctt 


gaagaagacg 


aagaagacag gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 
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tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 
gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 
cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 
gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 
ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2 04 0 
aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 
ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 
cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 
gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 
gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 
gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 24 0 0 
gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 
ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 252 0 
caccac 



<210> 376 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 376 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 



2100 
2160 
2220 
2280 
2340 



2526 
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Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Lys Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 
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Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 



His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 
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Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 


377 


<211> 


36 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


377 



gagaggctcc atcggaagaa gcttaagcgc ctcgag 



<210> 
<211> 
<212> 
<213> 



378 

18 

DNA 

Artificial 



<220> 

<223> Synthetic 

<400> 378 
ccgatggagc ctctccga 



<210> 379 

<211> 2526 

<212> DNA 

<213> Artificial' 
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<220> 

<223> Synthetic 
<400> 379 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaagaag 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 
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1680 
1740 
1800 



gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 
acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 
tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 
gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 
cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 192 0 
gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 
ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 
aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 
ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 
cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 
gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 
gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 
gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 
gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 
ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2 52 0 

2526 



caccac 



<210> 380 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 380 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 -L5 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 



2040 
2100 



2220 
2280 
2340 
2400 
2460 
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65 



70 



75 



80 



Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
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370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Lys Lys Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
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675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 

840 





835 


<210> 


381 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


381 


gtgcgcaccg acctcctcct 


<210> 


382 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


382 



gaggtccacc tccaggagga ggtcggtgcg cac 



<210> 383 



766 



<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 383 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 12 0 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 3 00 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 42 0 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 72 0 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctcct cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 84 0 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 102 0 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 114 0 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 12 00 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 12 60 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 13 80 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 
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gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2 040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 22 8 0 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 

caccac 2526 



<210> 384 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 384 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 
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Phe Ala Lys Ser 
50 

Val Phe Val Val 
65 

Tyr Glu Ala Tyr 



Arg Gin Leu Ala 
100 

Arg Leu Glu Val 
115 

Ala Lys Lys Ala 
130 

Asp Arg Asp Leu 
145 

Pro Glu Gly His 



Leu Arg Pro Glu 
180 

Ser Asp Asn Leu 
195 



Leu Leu Lys Ala 
55 

Phe Asp Ala Lys 
70 

Lys Ala Gly Arg 
85 

Leu lie Lys Glu 



Pro Gly Tyr Glu 
120 

Glu Lys Glu Gly 
135 

Tyr Gin Leu Val 
150 

Leu lie Thr Pro 
165 

Gin Trp Val Asp 



Pro Gly Val Lys 
200 



Leu Lys Glu Asp 
60 

Ala Pro Ser Phe 
75 

Ala Pro Thr Pro 
90 

Leu Val Asp Leu 
105 

Ala Asp Asp Val 



Tyr Glu Val Arg 
140 

Ser Asp Arg Val 
155 

Glu Trp Leu Trp 
170 

Phe Arg Ala Leu 
185 

Gly He Gly Glu 



Gly Tyr Lys Ala 



Arg His Glu Ala 
80 

Glu Asp Phe Pro 
95 

Leu Gly Phe Thr 
110 

Leu Ala Thr Leu 
125 

lie Leu Thr Ala 



Ala Val Leu His 
160 

Glu Lys Tyr Gly 
175 

Val Gly Asp Pro 
190 

Lys Thr Ala Leu 
205 



Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 



Glu Asp Leu Arg 



Leu Leu Glu Val 
260 

Leu Arg Ala Phe 
275 



Leu Ser Leu Glu 
245 

Asp Leu Ala Gin 



Leu Glu Arg Leu 
280 



Leu Ser Arg Val 
250 

Gly Arg Glu Pro 
265 

Glu Phe Gly Ser 



Arg Thr Asp Leu 
255 

Asp Arg Glu Gly 
270 

Leu Leu His Glu 
285 



Phe Gly Leu Leu Glu Ala Pro Ala 
290 295 

Pro Pro Glu Gly Ala Phe Val Gly 
305 310 

Met Trp Ala Glu Leu Lys Ala Leu 
325 

His Arg Ala Ala Asp Pro Leu Ala 
340 



Pro Leu Glu Glu Ala Pro Trp Pro 
300 

Phe Val Leu Ser Arg Pro Glu Pro 
315 320 

Ala Ala Cys Arg Gly Gly Arg Val 
330 335 

Gly Leu Lys Asp Leu Lys Glu Val 
345 350 
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Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 



770 



Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 

840 





835 


<210> 


385 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


385 



ggggccttcg tgggctacgt cctctcccgc ccc 



<210> 386 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 386 
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ggggcgggag aggacgtagc ccacgaaggc ccc 



33 



<210> 387 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 387 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 



ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 
tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 
ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 
gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 
atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 
cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 
cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 
ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 
ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 
gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 
gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 
gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 
gccccctggc ccccgccgga aggggccttc gtgggctacg tcctctcccg ccccgagccc 
atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 
gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 
gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 
gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 
gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 
cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 132 0 
ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 13 8 0 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 144 0 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 174 0 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2 04 0 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccc tggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 222 0 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 22 8 0 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 234 0 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 252 0 

caccac 2526 



<210> 388 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 388 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 
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Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 



Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 
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His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 
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Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 



Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 389 

<211> 42 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 389 

gaaaacctcc tcaagcacct ggaccaggta aagccagaaa ac 



<210> 390 

<211> 42 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 390 



gttttctggc 


tttacctggt 


ccaggtgctt 


gaggaggttt 


tc 




<210> 391 
<211> 2526 
<212> DNA 
<213> Artificial 










<220> 

<223> Synthetic 










<400> 391 
atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


gg^tacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


^99^9999ta 


cgaggtgcgc 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cgg9gtcaag 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


ctcaagcacc 


tggaccaggt 


aaagccagaa 


aacqtccqqq 


agaagatcaa 


ggcccacctg 


gaagacctca 


ggctctcctt 


ggagctctcc 


cqqqtqcqca 


ccgacctccc 


CCtqqaqqtq 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggc 1 1 a 


gggccttcct 


qqaqaqqctq 


gagttcggca 


gcctcctcca 


C9^9ttcggc 


ctcctggagg 


cccccgcccc 


cctqaaqqaq 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


gaccccttgg 


C9g9gctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 



42 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 132 0 
ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 13 80 
caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 
ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 
gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 
gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 
gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 
acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 
tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 
gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 
cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 
gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 
ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 
aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 
ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 
cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 
gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 
gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 
gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 
gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 
ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 



caccac 



<210> 392 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 



1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 



2520 
2526 



<400> 392 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
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1 



5 



10 



15 



Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu 
210 215 220 

Asp Gin Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
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305 



310 



315 



320 



Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 



Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 



780 



610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 393 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 393 

gagctctccc gggtgcacac cgacctcccc ctg 33 



<210> 394 



781 



<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 394 

cagggggagg tcggtgtgca cccgggagag etc 33 

<210> 395 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 395 



atgaat tccg 


aggcgatgct 


tccgct cttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


^ ^ ^ \^ CL ^ 




V-^ C4. w i-' V- w w ^ 


^ w ^ ^ Zj 13 


gcc t caccac 


3 "-3 ^ 3 y 13 y ^ 


120 


/--T o o r^rtn i~ n 


cty y t_y y l. (_ u d 


yyL-ucL-yoi— 


ddydyo^ou.*^^ 


L-^ddy y u 


y ddy y dy y ct*w- 


180 


L-dtwdctyy 


u-yuL-(-ui^yu 


yy u L-uyd^^ 


y v_. K_ cL cty y 


/-I r-" -f- f- r^/^rr 


d v_. y dy y 


24 0 


L.cicyciyyL.CL, 


a 13 a 

dOdciyyv^yyy 


y dy y y \^ oy 


d v-. L. o y dy y 


dtw L. L.v^t^t—v—'wy 


yv_-dy^ UL-yt^L- 




ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


^99^9599^ a 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcaca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


C99ggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 
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gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 12 00 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 132 0 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 144 0 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 222 0 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 22 80 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 234 0 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 24 60 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2 52 0 

caccac 2526 

<210> 396 
<211> 842 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic 



783 



<400> 396 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
2 5 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 
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Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 



Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 
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Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 



Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 397 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 397 



785 



gagctctccc gggtgcacac cgacctcctc ctg 



33 



<210> 398 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 398 

caggaggagg tcggtgtgca cccgggagag etc 3 3 



<210> 399 

<211> 2526 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 399 



a y~ /~T ~i +— •}— /-I 

d-Ly ciau cccy 


d.y y ego. L.y c u 


4~ /"< /~f 4— z"' "H 

uccgcccL-cu 


y a. ci c c c a. d. o, y 


y u oy y y L- o u L, 


\^\^ uyy uyy Civ- 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcaca 


ccgacctcct 


cctggaggtg 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 114 0 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 12 00 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 12 60 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 132 0 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 13 8 0 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 144 0 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 162 0 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 168 0 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 18 60 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 222 0 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 22 80 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 24 00 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 252 0 

caccac 2526 

<210> 400 
<211> 842 
<212> PRT 



788 



<213> Artificial 



<220> 

<223> Synthetic 
<400> 400 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 



Asp Arg Asp Leu Tyr Gin Leu Val 
145 150 

Pro Glu Gly His Leu lie Thr Pro 
165 

Leu Arg Pro Glu Gin Trp Val Asp 
180 



Ser Asp Arg Val Ala Val Leu His 
155 160 

Glu Trp Leu Trp Glu Lys Tyr Gly 
170 175 

Phe Arg Ala Leu Val Gly Asp Pro 
185 190 



Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu 
245 250 255 

Leu Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 
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Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 



Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 
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Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 



Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 401 

<211> 2508 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 401 



atgaattccg 


aggcgatgct 


tccgctcttt gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 


12 0 


gaaccggtgc 


aggcggtcta 


cggcttcgcc aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


999tacaagg 


ccgtcttcgt 


ggtctttgac gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa aacgtccggg agaagatcaa ggcccacctg 


720 


gaagacctca ggctctcctt ggagctctcc cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga ggagaagctc gtttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


gaagaagacg aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 
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gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 168 0 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 180 0 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2 04 0 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 222 0 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 22 80 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 24 0 0 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 24 60 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggt 2 50 8 



<210> 402 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<22 3> Synthetic 
<400> 402 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 
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Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 SO 95 



Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 



Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 
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Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 



Glu Trp Thr Glu Asp 
405 

His Arg Asn Leu Leu 
420 

Leu Tyr His Glu Val 
435 

Glu Ala Thr Gly Val 
450 

Leu Glu Leu Ala Glu 
465 

Leu Ala Gly His Pro 
485 

Val Leu Phe Asp Glu 
500 

Thr Gly Lys Arg Ser 
515 

Ala His Pro lie Val 
530 

Leu Lys Asn Thr Tyr 
545 

Thr Gly Arg Leu His 
565 

Arg Leu Ser Ser Ser 
580 

Pro Leu Gly Gin Arg 
595 

Ala Leu Val Ala Leu 
610 

His Leu Ser Gly Asp 
625 

Asp He His Thr Gin 
645 

Ala Val Asp Pro Leu 
660 

Val Leu Tyr Gly Met 
675 



Ala Ala His Arg Ala Leu 
410 

Lys Arg Leu Glu Gly Glu 
425 

Glu Lys Pro Leu Ser Arg 
440 

Arg Arg Asp Val Ala Tyr 
455 

Glu He Arg Arg Leu Glu 
470 475 

Phe Asn Leu Asn Ser Arg 
490 

Leu Arg Leu Pro Ala Leu 
505 

Thr Ser Ala Ala Val Leu 
520 

Glu Lys lie Leu Gin His 
535 

Val Asp Pro Leu Pro Ser 
550 555 

Thr Arg Phe Asn Gin Thr 
570 

Asp Pro Asn Leu Gin Asn 
585 

lie Arg Arg Ala Phe Val 
600 

Asp Tyr Ser Gin He Glu 
615 

Glu Asn Leu lie Arg Val 
630 635 

Thr Ala Ser Trp Met Phe 
650 

Met Arg Arg Ala Ala Lys 
665 

Ser Ala His Arg Leu Ser 
680 



Leu Ser Glu Arg Leu 
415 

Glu Lys Leu Val Trp 
430 

Val Leu Ala His Met 
445 

Leu Gin Ala Leu Ser 
460 

Glu Glu Val Phe Arg 
480 

Asp Gin Leu Glu Arg 
495 

Lys Lys Thr Lys Lys 
510 

Glu Ala Leu Arg Glu 
525 

Arg Glu Leu Thr Lys 
540 

Leu Val His Pro Arg 
560 

Ala Thr Ala Thr Gly 
575 

He Pro Val Arg Thr 
590 

Ala Glu Ala Gly Trp 
605 

Leu Arg Val Leu Ala 
620 

Phe Gin Glu Gly Lys 
640 

Gly Val Pro Pro Glu 
655 

Thr Val Asn Phe Gly 
670 

Gin Glu Leu Ala He 
685 
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Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 

<210> 403 

<211> 66 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<221> modif ied_base 

<222> (19) . . (51) 

<223> The bases in these positions are a synthesis of 91% base shown an 
d 3% all other bases 



<400> 403 

ctccatcgga acctccttaa gcgcctcgag ggggaggaga agctcctttg gctctaccac 6 0 

gaggtg 66 



<210> 404 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 
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<223> Synthesis 

<400> 404 

aaggaggttc cgatggag 18 

<210> 405 

<211> 2508 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 405 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


^gctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 
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cttaagcgcc 




ggagaagc t c 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


yy ^yy y "-^y 


ggcctacctt 


1380 


CB.ggcccttt 


ccctggagct 


tQcqqaqqaq 


atccgccgcc 




ggtcttccgc 


1440 


ttqqccrqqcc 


accccttcaa 


cctcaactcc 


cqqaaccaqc 


tqqaaaqqqt 


gctctttgac 


1500 


craacfctaaac 


t tcccgcctt 


gaagaagacg 


c*. c*. d c< Vj 


apaaaPQPt" p 


C->1 W C*. ^-j ^ 


1560 


acaafcactaa 


aciar* c r 1" a r cr 


^ y ^ CL 


r'r'/-';a f- pcrf"crcr 

V— CI y I— yy 


a era a era 1" p p t* 


ppa crpa ppcicr 

w w Ct y O W w ^ 


1620 


ga.gctca.cc^ 


agctcaagaa 


cacct acgtg 


^ N-' ^ w V*^ W 


caaacct cat 


ccacccaaQQ 

W V- w ^ 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacacracjaa 


gc ttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


t gggc cagag 


gatccgccgg 


1800 


gccttcgtgg 


C CQaqaccTQQ 




cj tacr c c c t crcr 

y ^yy ^yy 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 




^ 


ocrcf t p 1 1" p pa 


craa oraacraa cr 

y y ciy yyy CLO-y 


1920 


gacatccaca 


cccagaccgc 


aagctgga tg 


1" t~ raapot" cp 

w w v_> ^ ^ Zj ^-^ Vi^ 


pcpcaaacrap 

v^y y o.y y 


cgtggacCCC 


1980 


ctgatgcgcc 




era CQCrt" crp5 3 p 


^ ^ Zj ZJ 


I— v — O Ca V_-. CL ^ 


crt ppcfpppat" 


2040 


^g9ctct:ccc 


aggagcttgc 


cat cccctac 


aacraaacrcacr 
y *-*-y y "-y y ^y y 


taacctttat 


aga^gcgc tac 


2100 


ttccaaagc t 


t ccccaaggt 




t" aaa a a a era 


p p p t Gfcra crcra 

^ ^ u-yyo-yyci 


CTQCfcraQQaaQ 
yyyy*^yy*-*-"y 


2160 


c a cr cr a c fc a c o 


tacraaac net" 




piCicjncinf' ^ prr 
ciy y v_,y t_ ci><-'y 


1~ nr'pr'f^s pr^l" 
L.yi^*rfV^yctv--^ L- 


p ?i ;^ pfTP p Pf^cr 


2220 


cf t" era a Q a CTPcr 






yy ci 


ct^cti^y^L^^^y L- 


oiw^ciyyy\— cLk^i^ 


22 80 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggt 




2508 



<210> 406 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 406 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
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20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
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325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Val Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
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625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 

<210> 407 

<211> 2508 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 407 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 
ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 
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tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 3 00 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 3 60 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 42 0 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 54 0 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 66 0 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 72 0 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 102 0 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaa^caccacc cccgaggggg tggcgcggcg ctacgggggg 12 00 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgaa acgaggggaa ggagaagctc ctttggctct accacgaggt ggaaaagccc 132 0 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 13 80 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 144 0 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 162 0 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

^cgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 174 0 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 
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gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 222 0 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 22 80 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 234 0 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggt 2508 



<210> 408 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<22 3> Synthetic 
<400> 408 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 
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Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 



Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Asn Glu Gly Lys Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 



804 



Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 



Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 



Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 
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Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 

<210> 409 

<211> 2508 

<212> DNA 

<213> Artificial 



llJ <22 0> 



<22 3> Synthetic 
<400> 409 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 




atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 




gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 




gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 




gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 




g^gtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 




cttaagcgct 


tcgaggggga 


ggagaagctc 


ctttgcctct 


accacgaggt 


ggaaaagccc 


1320 




ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 




caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


iff 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 




gagcttaggc 


ttcccgcctt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 




gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


m 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 




acgggccgcc 


tccacacccg 


cttcaaccag 


a-cggccacgg 


ccacggggag 


gcttagtagc 


1740 




tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


4"- 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 




cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 




gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 




ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 




aggctctccc 


sggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 




ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 




cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 




gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 




gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 




gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 




gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 




ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggt 




2508 



<210> 410 
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<211> 836 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 410 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
3_ 5 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 
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Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 



Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Phe Glu Gly Glu Glu Lys Leu Leu Cys 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 



Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 
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Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 

<210> 411 
<211> 2508 
<212> DNA 
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<213> Artificial 
<220> 

<223> Synthetic 

<400> 411 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa aggaggggta cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta gggccttcct ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg gggacgaccc 


catgctcctc 


1140 


oTT'tacc tec 


fcggacccttc 


gaacaccacc 


cccgaggggg tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccttttt 


gaagaagacg 


aagaagacag gcaagcgctc 


caccagcgcc 


1560 
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gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 162 0 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 174 0 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2 04 0 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 222 0 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggt 2508 



<210> 412 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 412 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
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65 



70 



75 



80 



Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
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370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Phe Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 



814 



675 



680 



685 



Pro Tyr Glu Glu Ala Val Ala Phe lie Gin Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 

<210> 413 

<211> 42 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (19) . , (21) 

<223> n is any base a,t,c, or g. 



<400> 413 

caggagctta ggcttcccnn nttgaagaag acgaagaaga ca 



<210> 414 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<2 23> Synthetic 



815 



<400> 414 
cctaagctcg tcaaagag 



18 



<210> 415 

<211> 2508 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 415 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 24 0 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 3 00 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 3 60 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 42 0 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 
ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 
gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 84 0 
gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 
gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 1020 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 12 00 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 132 0 



720 
780 



900 
960 
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ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 13 80 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 144 0 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 150 0 

gagcttaggc ttcccgtttt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 162 0 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 192 0 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 198 0 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 204 0 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 222 0 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 22 8 0 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggt 25 08 



<210> 416 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 416 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 



817 



Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 
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His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Val Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 



819 



Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 VOO 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 VIO 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 

835 

<210> 417 

<211> 2508 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 417 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 
ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 



60 
120 
180 
240 



820 



tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 3 00 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 42 0 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 54 0 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 72 0 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 102 0 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 114 0 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 

gs-gtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 12 60 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 132 0 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 13 80 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccagttt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 15 60 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 162 0 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

a-cgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 192 0 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 



821 



ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 
aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 
ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 
cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 
gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 
gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 
gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 
gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 
ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggt 



<210> 418 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 418 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 



2220 
2280 
2340 
2400 
2460 
2508 
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Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 



Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 
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Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ser Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 
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Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 

<210> 419 

<211> 2508 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 419 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


999tacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


g9993.agcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 
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gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


gaagaagacg 


aagaagacag gcaagcgctc 


caccggtgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggt 




2508 



<210> 420 
<211> 836 



826 



<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 420 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
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260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Gly Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
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565 



570 



575 



Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 



His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 



610 



615 



620 
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<210> 
<211> 
<212> 
<213> 



Artificial 



421 

42 

DNA 
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<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (16) . . (18) 

<223> n is any base a,t,c, or g. 



<400> 421 

ggcaagcgct ccaccnnngc cgcggtgctg gaggccctac gg 



<210> 422 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 422 
ggtggagcgc ttgcc 

<210> 423 

<211> 2508 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 
<400> 423 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 
ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 
tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 
ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 
gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 
atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 
cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 
cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 
ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 
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cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 




gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 




gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 




gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 
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gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 
ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggt 2508 



<210> 424 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 424 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro hys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 
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Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 



Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Leu Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 
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Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 



Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 
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Ser Ala Lys Gly 
835 

<210> 425 

<211> 42 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (19) . , (21) 

<223> n is any base a,t,c, or g. 



<400> 425 

ggcaagcgct ccaccagcnn ngcggtgctg gaggccctac gg 42 



<210> 426 
<211> 2508 
<212> DHA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 426 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 3 00 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 42 0 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 720 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 
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gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 84 0 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 
ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagccgt 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggt 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 



2460 
2508 



836 



<210> 427 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 427 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 



837 



245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 

290 295 300 

Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Arg Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 



838 



Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 



Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 



<210> 428 
<211> 2499 



839 



<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 428 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 24 0 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 3 60 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 54 0 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 60 0 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 72 0 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctca acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 1020 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 10 80 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 12 00 

gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 1260 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 132 0 

gtcctggccc atatggaggc cacgggggtg cgccgggacg tggcctatct cagggccttg 13 80 

tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 15 00 



840 



cttcccgcca 


tcaagaagac 


gcaaaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg ggagctcacc 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


taqaqqcccg 


ggtgaagagc 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggag 







<210> 429 

<211> 833 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 429 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2499 
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Val Vai Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 



Gly Tyr Lys Ala Gly 
85 

Leu Ala Leu lie Lys 
100 

Glu Val Pro Gly Tyr 
115 

Lys Ala Glu Lys Glu 
130 

Asp Leu Tyr Gin Leu 
145 

Gly Tyr Leu lie Thr 
165 

Pro Asp Gin Trp Ala 
180 

Asn Leu Pro Gly Val 
195 

Leu Glu Glu Trp Gly 
210 

Leu Lys Pro Ala lie 
225 

Lys Leu Ser Trp Asp 
245 

Val Asp Phe Ala Lys 
260 

Phe Leu Glu Arg Leu 
275 

Leu Glu Ser Pro Lys 
290 

Gly Ala Phe Val Gly 
305 

Asp Leu Leu Ala Leu 
325 

Pro Glu Pro Tyr Lys 
340 

Leu Ala Lys Asp Leu 
355 



Arg Ala Pro Thr Pro Glu 
90 

Glu Leu Val Asp Leu Leu 
105 

Glu Ala Asp Asp Val Leu 
120 

Gly Tyr Glu Val Arg lie 
135 

Leu Ser Asp Arg lie His 
150 155 

Pro Ala Trp Leu Trp Glu 
170 

Asp Tyr Arg Ala Leu Thr 
185 

Lys Gly lie Gly Glu Lys 
200 

Ser Leu Glu Ala Leu Leu 
215 

Arg Glu Lys lie Leu Ala 
230 235 

Leu Ala Lys Val Arg Thr 
250 

Arg Arg Glu Pro Asp Arg 
265 

Glu Phe Gly Ser Leu Leu 
280 

Ala Leu Glu Glu Ala Pro 
295 

Phe Val Leu Ser Arg Lys 
310 315 

Ala Ala Ala Arg Gly Gly 
330 

Ala Leu Arg Asp Leu Lys 
345 

Ser Val Leu Ala Leu Arg 
360 



Asp Phe Pro Arg Gin 
95 

Gly Leu Ala Arg Leu 
110 

Ala Ser Leu Ala Lys 
125 

Leu Thr Ala Asp Lys 
140 

Val Leu His Pro Glu 
160 

Lys Tyr Gly Leu Arg 
175 

Gly Asp Glu Ser Asp 
190 

Thr Ala Arg Lys Leu 
205 

Lys Asn Leu Asp Arg 
220 

His Met Asp Asp Leu 
240 

Asp Leu Pro Leu Glu 
255 

Glu Arg Leu Arg Ala 
270 

His Glu Phe Gly Leu 
285 

Trp Pro Pro Pro Glu 
300 

Glu Pro Met Trp Ala 
320 

Arg Val His Arg Ala 
335 

Glu Ala Arg Gly Leu 
350 

Glu Gly Leu Gly Leu 
365 



842 



Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Lys Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 
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Gly Met Ser Ala 
675 

Glu Ala Gin Ala 
690 

Arg Ala Trp lie 
705 

Val Glu Thr Leu 



Arg Val Lys Ser 
740 

Pro Val Gin Gly 
755 

Leu Phe Pro Arg 
770 

Ala Asn Glu Leu 
785 

Ala Arg Leu Ala 



Pro Leu Glu Val 
820 

Glu 



His Arg Leu Ser 
680 

Phe lie Glu Arg 
695 

Glu Lys Thr Leu 
710 

Phe Gly Arg Arg 
725 

Val Arg Glu Ala 



Thr Ala Ala Asp 
760 

Leu Glu Glu Met 
775 

Val Leu Glu Ala 
790 

Lys Glu Val Met 
805 

Glu Val Gly lie 



Gin Glu Leu Ala 



Tyr Phe Gin Ser 
700 

Glu Glu Gly Arg 
715 

Arg Tyr Val Pro 
730 

Ala Glu Arg Met 
745 

Leu Met Lys Leu 



Gly Ala Arg Met 

780 

Pro Lys Glu Arg 
795 

Glu Gly Val Tyr 
810 

Gly Glu Asp Trp 
825 



lie Pro Tyr Glu 
685 

Phe Pro Lys Val 



Arg Arg Gly Tyr 
720 

Asp Leu Glu Ala 
735 

Ala Phe Asn Met 
750 

Ala Met Val Lys 
765 

Leu Leu Gin Val 



Ala Glu Ala Val 
800 

Pro Leu Ala Val 
815 

Leu Ser Ala Lys 
830 



<210> 430 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 430 

acgggggtgc gccgggacgt ggcctat 



<210> 


431 


<211> 


2508 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


431 



atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 
ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 
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gaaccggtgc 


aggcggtcta 


cggcttcgcc 




gggtacaagg 


ccgtcttcgt 


ggtctttgac 




tacgaggcct 


acaaggcggg 


gagggccccg 




ctcatcaagg 


agctggtgga 


cctcctgggg 




gcggacgacg 


ttctcgccac 


cctggccaag 




atcctcaccg 


ccgaccgcga 


cctctaccaa 




cccgagggcc 


acctcatcac 


cccggagtgg 




cagtgggtgg 


acttccgcgc 


cctcgtgggg 




ggcatcgggg 


agaagaccgc 


cctcaagctc 




ctcaagaacc 


tggaccgggt 


aaagccagaa 


f"' 


gaagacctca 


ggctctcctt 


ggagctctcc 




gacctcgccc 


aggggcggga 


gcccgaccgg 




gagttcggca 


gcctcctcca 


cgagttcggc 




gccccctggc 


ccccgccgga 


aggggccttc 


nl 


atgtgggcgg 


agcttaaagc 


cctggccgcc 




gaccccttgg 


cggggctaaa 


ggacctcaag 


y. 


gccgtcttgg 


cctcgaggga 


ggggctagac 




gcctacctcc 


tggacccttc 


gaacaccacc 




gagtggacgg 


aggacgccgc 


ccaccgggcc 




cttaagcgcc 


tcgaggggga 


ggagaagctc 




ctctcccggg 


tcctggccca 


tatggaggcc 




caggcccttt 


ccctggagct 


tgcggaggag 




ttggcgggcc 


accccttcaa 


cctcaactcc 




gagcttaggc 


ttcccaagtt 


gaagaagacg 




gcggtgctgg 


aggccctacg 


g9a-99cccac 




gagctcacca 


agctcaagaa 


cacctacgtg 




acgggccgcc 


tccacacccg 


cttcaaccag 




tccgacccca 


acctgcagaa 


catccccgtc 



aagagcctcc tcaaggccct gaaggaggac 18 0 

gccaaggccc cctccttccg ccacgaggcc 240 

acccccgagg acttcccccg gcagctcgcc 300 

tttacccgcc tcgaggtccc cggctacgag 3 60 

aaggcggaaa aggaggggta cgaggtgcgc 420 

ctcgtctccg accgcgtcgc cgtcctccac 480 

ctttgggaga agtacggcct caggccggag 540 

gacccctccg acaacctccc cggggtcaag 600 

ctcaaggagt ggggaagcct ggaaaacctc 660 

aacgtccggg agaagatcaa ggcccacctg 72 0 

cgggtgcgca ccgacctccc cctggaggtg 780 

gaggggctta gggccttcct ggagaggctg 84 0 

ctcctggagg cccccgcccc cctggaggag 900 

gtgggcttcg tcctctcccg ccccgagccc 960 

tgcaggggcg gccgcgtgca ccgggcagca 102 0 

gaggtccggg gcctcctcgc caaggacctc 1080 

ctcgtgcccg gggacgaccc catgctcctc 1140 

cccgaggggg tggcgcggcg ctacgggggg 12 00 

ctcctctcgg agaggctcca tcggaacctc 1260 

ctttggctct accacgaggt ggaaaagccc 132 0 

accggggtac ggcgggacgt ggcctacctt 13 8 0 

atccgccgcc tcgaggagga ggtcttccgc 144 0 

cgggaccagc tggaaagggt gctctttgac 1500 

aagaagacag gcaagcgctc caccagcgcc 1560 

cccatcgtgg agaagatcct ccagcaccgg 162 0 

gaccccctcc caagcctcgt ccacccgagg 168 0 

acggccacgg ccacggggag gcttagtagc 174 0 

cgcaccccct tgggccagag gatccgccgg 1800 
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QCCttCQtQQ 


CCqaqqcqqq 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


aggctctccc 


aqaaqcttqc 


catcccctac 


qagqagqcgg 

3 Zf^ 


tggcctttat 


agagcgctac 


ttccaaagct 


tccccaaggt 


qcqqqcctqq 


atagaaaaga 


CCCtqqaqqa 


^ -mj J —if — / —J ^ 


CQQQQCtacq 


tggaaaccct 


cttcggaaga 


aqqcqctacq 


tgcccgacct 


caacgcccgg 


qtqaaqaqcq 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggt 





<210> 432 

<211> 836 

<212i' PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 432 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2508 
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115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 



847 



420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 



848 



725 



730 



735 



Leu Asn Ala Arg Val Lys Ser Val Arg 

740 745 

Phe Asn Met Pro Val Gin Gly Thr Ala 
755 760 

Met Val Lys Leu Phe Pro Arg Leu Arg 
770 775 

Leu Gin Val Ala Asn Glu Leu Leu Leu 
785 790 

Glu Glu Val Ala Ala Leu Ala Lys Glu 
805 

Leu Ala Val Pro Leu Glu Val Glu Val 
820 825 

Ser Ala Lys Gly 
835 

<210> 433 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 433 

cttaggcttc ccaagttgaa gaagacgaag aagaca 

<210> 434 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 434 

tgtcttcttc gtcttcttca acttgggaag cctaag 

<210> 435 

<211> 2508 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 435 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 



Glu Ala Ala Glu Arg Met Ala 
750 

Ala Asp Leu Met Lys Leu Ala 
765 

Glu Met Gly Ala Arg Met Leu 
780 

Glu Ala Pro Gin Ala Arg Ala 
795 800 

Ala Met Glu Lys Ala Tyr Pro 
810 815 

Gly Met Gly Glu Asp Trp Leu 
830 



849 



ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 
tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 
ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 
gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 
atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 
cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 
cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 
ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 
ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 
gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 
gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 
gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 
gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 
atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 
gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 
gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 
gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 
gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 
cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 
ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 
caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 
ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 
gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 
gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 
gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 
acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



850 



tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatcgcca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 210 0 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 222 0 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 22 8 0 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gagg^ggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggt 2508 



<210> 436 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 436 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 SO 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 



851 



Arg hen Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 



852 



His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 



Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He Ala Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 
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Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 



Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 

<210> 437 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 437 

9^9999^^99 acatcgccac ccagaccgca age 3 3 



<210> 438 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 438 

gcttgcggtc tgggtggcga tgtccttccc etc 33 



<210> 439 

<211> 2508 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



854 



<400> 439 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 30 0 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 42 0 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 48 0 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 54 0 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 72 0 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 84 0 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 1020 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 12 00 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 13 8 0 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 144 0 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt gaagaagccg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 162 0 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 



855 



acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggt 




2508 



<210> 440 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 440 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 ' 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 



856 



Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 1B5 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 



Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 



857 



Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 



His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Pro Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 



858 



Pro Lys Val Arg 
705 

Arg Gly Tyr Val 



Leu Asn Ala Arg 
740 

Phe Asn Met Pro 
755 

Met Val Lys Leu 
770 

Leu Gin Val Ala 
785 

Glu Glu Val Ala 



Leu Ala Val Pro 
820 



Ala Trp He Glu 
710 

Glu Thr Leu Phe 
725 

Val Lys Ser Val 



Val Gin Gly Thr 
760 

Phe Pro Arg Leu 
775 

Asn Glu Leu Leu 
790 

Ala Leu Ala Lys 
805 

Leu Glu Val Glu 



Lys Thr Leu Glu 
715 

Gly Arg Arg Arg 
730 

Arg Glu Ala Ala 
745 

Ala Ala Asp Leu 



Arg Glu Met Gly 
780 

Leu Glu Ala Pro 
795 

Glu Ala Met Glu 
810 

Val Gly Met Gly 
825 



Glu Gly Arg Lys 
720 

Tyr Val Pro Asp 
735 

Glu Arg Met Ala 
750 

Met Lys Leu Ala 
765 

Ala Arg Met Leu 



Gin Ala Arg Ala 
800 

Lys Ala Tyr Pro 
815 

Glu Asp Trp Leu 
830 



Ser Ala Lys Gly 
835 



<210> 441 

<211> 33 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 441 

cccgccttga agaagccgaa gaagacaggc aag 



<210> 442 

<211> 33 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 442 

cttgcctgtc ttcttcggct tcttcaaggc ggg 



<210> 443 

<211> 2499 

<212> DNA 

<213> Artificial 



<220> 



859 



<223> Synthetic 
<400> 443 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 30 0 

saggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 54 0 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 72 0 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 84 0 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgttt accgggcgga ggatcccttg 1020 

gaggccttgc gggggcttgg ggaggtgagg gggcttttgg ccaaggacct ggcggtgctg 1080 

gccctgaggg aagggattgc cctggcaccg ggcgacgacc ccatgctcct cgcctacctc 1140 

ctggatcctt ccaacaccgc ccccgaaggg gtagcccggc gctacggggg ggagtggacc 12 0 0 

gaggaggcgg gggaaagggc gctgctttcc gaaaggcttt acgccgccct cctgaagcgg 12 60 

cttaaggggg aggagaggct tctttggctt tacgaggagg tggaaaagcc cctttcgcgg 1320 

gtcctggccc acatggaggc cacgggggta cggttggatg tggcctactt aaaggccctt 13 80 

tccctggagg tggaggcgga gataaggcgc ttcgaggagg aggtccaccg cctggccggg 1440 

catcctttca acctgaactc ccgggaccag ctggaaaggg tcatctttga cgagcttggg 1500 

cttcccgcca tcaagaagac gaggaagacg ggcaagcgct ccaccagcgc cgccgttttg 1560 

gaggccttgc gggaggctca tcccatcgtg gaccgcatcc ttcagtaccg ggagctttcc 162 0 



860 



aagctcaagg gaacctacat cgatcccttg cctgccctgg tccaccccaa gacgaaccgc 1680 
ctccacaccc gtttcaacca gacggccacc gccacgggga ggcttagcag ctcggatcct 1740 
aatctgcaaa atatccccgt gcgcacccct ttgggccagc ggatccgccg ggccttcgtg 
gccgaggagg ggtggaggct ggtggttttg gactacagcc agattgagct cagggtcctg 
gcgcaccttt ccggggacga gaacctaatc cgggtcttcc aggagggcca ggacatccac 
acccagacgg ccagctggat gttcggcgtg cccccagagg ccgtggattc cctgatgcgc 
cgggcggcca agaccatcaa cttcggcgtc ctctacggca tgtccgccca ccggctttcg 
ggagagctgg ccatccccta cgaggaggcg gtggccttca tcgagcggta tttccagagc 
taccccaagg tgcgggcctg gattgagaaa accctggcgg aaggacggga acggggctat 
gtggaaaccc tctttggccg ccggcgctac gtgcccgact tggcttcccg ggtgaagagc 
atccgggagg cagcggagcg catggccttc aacatgccgg tccaggggac cgccgcggat 
ttgatgaaac tggccatggt gaagctcttt cccaggcttc aggagctggg ggccaggatg 
cttttgcagg tgcacaacga actggtcctc gaggctccca aggagcaagc ggaggaagtc 
gcccaggagg ccaagcggac catggaggag gtgtggcccc tgaaggtgcc cttggaggtg 
gaagtgggca tcggggagga ctggctttcc gccaaggcc 



<210> 444 

<211> 833 

<212> PRT 

<213> Artificial 

<220> 

<22 3> Synthetic 
<400> 444 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2499 



861 



85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val Tyr Arg Ala 
325 330 335 

Glu Asp Pro Leu Glu Ala Leu Arg Gly Leu Gly Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly lie Ala Leu 
355 360 365 

Ala Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Ala Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 



862 



385 



390 395 400 



Glu Glu Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu Tyr Ala Ala 
405 410 415 

Leu Leu Lys Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp Leu Tyr Glu 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Lys Ala Leu Ser Leu Glu Val 
450 455 460 

Glu Ala Glu He Arg Arg Phe Glu Glu Glu Val His Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val He Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Lys Lys Thr Arg Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Asp Arg He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys Gly 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Asn Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Arg Leu Val 
595 600 605 

Val Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Gin Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Ser Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gly Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Tyr Pro Lys Val 



863 



690 695 VOO 

Arg Ala Trp He Glu Lys Thr Leu Ala Glu Gly Arg Glu Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala Ser 
725 730 735 

Arg Val Lys Ser He Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Gin Ala Glu Glu Val 
785 790 795 800 

Ala Gin Glu Ala Lys Arg Thr Met Glu Glu Val Trp Pro Leu Lys Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Ala 



<210> 445 

<211> 2496 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 445 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 



60 
12 0 
180 



caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 24 0 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 
aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 



360 
420 
480 



gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 



600 



864 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcaac aagcccggtt 
gaggccctgg ccgacctcaa ggaggcccgg gggttcctgg ccaaggacct ggccgttttg 
gccctgcggg agggggtggc cctggacccc acggacgacc ccctcctggt ggcctacctc 
ctggacccgg ccaacaccca ccccgagggg gtggcccggc gctacggggg cgagttcacg 
gaggacgcag cggagagggc cctcctctcc gagaggctct tccagaacct ctttaaacgg 
ctttccgaga agctcctctg gctctaccag gaggtggagc ggcccctctc ccgggtcttg 
gcccacatgg aggcccgggg ggtgaggctg gacgtccccc ttctggaggc cctctccttt 
gagctggaga aggagatgga gcgcctggag ggggaggtct tccgtttggc cggccacccc 
ttcaacctca actcccgcga ccagctggaa agggtcctct ttgacgagct gggcctcacc 1500 
ccggtgaagc ggacgaagaa gacgggcaag cgctccaccg cccagggggc cctggaggcc 1560 
ctccgggggg cccaccccat cgtggagctc atcctccagt accgggagct ttccaagctc 162 0 
aaaagcacct acctggaccc cctgccccgg ctcgtccacc cgcggacggg ccggctccac 
acccgcttca accagacggc cacggccacg ggaaggcttt ccagctccga ccccaacctg 
cagaacatcc ccgtgcgcac ccccttgggg cagcgcatcc gcaaggcctt cgtggccgag 
gaggggtggc tccttttggc ggcggactac tcccagattg agctccgggt cctggcccac 
ctctcggggg acgagaacct gaagcgggtc ttccgggagg ggaaggacat ccataccgag 
accgccgcct ggatgttcgg cttagacccc gctctggtgg atccaaagat gcgccgggcg 1980 
gccaagacgg tcaacttcgg cgtcctctac gggatgtccg cccacaggct ctcccaggag 2040 
ctcggcatag actacaagga ggcggaggcc tttattgagc gctacttcca gagcttcccc 2100 
aaggtgcggg cctggataga aaggaccctg gaggagggcc ggacgcgggg ctacgtggag 2160 
accctgttcg gcaggaggcg ctatgtgccc gacctggcct cccgggtccg ctcggtgcgg 222 0 
gaggcggcgg agcggatggc cttcaacatg cccgtgcagg gcaccgccgc cgacctgatg 2280 



1680 
1740 
1800 
1860 
1920 



865 



2496 



aagatcgcca tggtcaagct cttccccagg ctaaagcccc tgggggccca cctcctcctc 2340 

caagtggcca acgagctggt cctggaggtg cccgaggacc gggccgagga ggccaaggcc 2400 

ctggtcaagg aggtcatgga gaacgcctac cccctggacg tgcccctcga ggtggaggtg 2460 
ggcgtgggtc gggactggct ggaggcgaag caggat 



<210> 446 

<211> 832 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 446 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 



866 



Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Thr Ser Pro Val Glu Ala Leu Ala Asp Leu Lys Glu Ala Arg Gly Phe 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly Val Ala Leu 
355 360 365 

Asp Pro Thr Asp Asp Pro Leu Leu Val Ala Tyr Leu Leu Asp Pro Ala 
370 375 380 

Asn Thr His Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Phe Thr 
385 390 395 400 

Glu Asp Ala Ala Glu Arg Ala Leu Leu Ser Glu Arg Leu Phe Gin Asn 
405 410 415 

Leu Phe Lys Arg Leu Ser Glu Lys Leu Leu Trp Leu Tyr Gin Glu Val 
420 425 430 

Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Arg Gly Val 
435 440 445 

Arg Leu Asp Val Pro Leu Leu Glu Ala Leu Ser Phe Glu Leu Glu Lys 
450 455 460 

Glu Met Glu Arg Leu Glu Gly Glu Val Phe Arg Leu Ala Gly His Pro 
465 470 475 480 

Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu 
485 490 495 

Leu Gly Leu Thr Pro Val Lys Arg Thr Lys Lys Thr Gly Lys Arg Ser 
500 505 510 



867 



Thr Ala Gin Gly Ala Leu Glu Ala Leu Arg Gly Ala His Pro lie Val 
515 520 525 



Glu Leu lie Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys Ser Thr Tyr 
530 535 540 

Leu Asp Pro Leu Pro Arg Leu Val His Pro Arg Thr Gly Arg Leu His 
545 550 555 560 

Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser 
565 570 575 

Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin Arg 
580 585 590 

lie Arg Lys Ala Phe Val Ala Glu Glu Gly Trp Leu Leu Leu Ala Ala 
595 600 605 

Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp 
610 615 620 

Glu Asn Leu Lys Arg Val Phe Arg Glu Gly Lys Asp He His Thr Glu 
625 630 635 640 

Thr Ala Ala Trp Met Phe Gly Leu Asp Pro Ala Leu Val Asp Pro Lys 
645 650 655 

Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr Gly Net 
660 665 670 

Ser Ala His Arg Leu Ser Gin Glu Leu Gly He Asp Tyr Lys Glu Ala 
675 680 685 



Glu Ala Phe He Glu Arg Tyr Phe 
690 695 

Trp He Glu Arg Thr Leu Glu Glu 
705 710 



Gin Ser Phe Pro Lys Val Arg Ala 
700 

Gly Arg Thr Arg Gly Tyr Val Glu 
715 720 



Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala Ser Arg Val 
725 730 735 

Arg Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val 
740 745 750 

Gin Gly Thr Ala Ala Asp Leu Met Lys He Ala Met Val Lys Leu Phe 
755 760 765 

Pro Arg Leu Lys Pro Leu Gly Ala His Leu Leu Leu Gin Val Ala Asn 
770 775 780 

Glu Leu Val Leu Glu Val Pro Glu Asp Arg Ala Glu Glu Ala Lys Ala 
785 790 795 800 

Leu Val Lys Glu Val Met Glu Asn Ala Tyr Pro Leu Asp Val Pro Leu 
805 810 815 



868 



Glu Val Glu Val Gly Val Gly Arg Asp Trp Leu Glu Ala Lys Gin Asp 
820 825 830 

<210> 447 
<211> 2508 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 447 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 



60 
120 
180 
240 



ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 42 0 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 60 0 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 72 0 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 84 0 
gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 
gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgttta ccgggcggag 102 0 

gatcccttgg aggccttgcg ggggcttggg gaggtgaggg ggcttttggc caaggacctg 1080 

gcggtgctgg ccctgaggga agggattgcc ctggcaccgg gcgacgaccc catgctcctc 1140 

gcctacctcc tggatccttc caacaccgcc cccgaagggg tagcccggcg ctacgggggg 1200 

gagtggaccg aggaggcggg ggaaagggcg ctgctttccg aaaggcttta cgccgccctc 12 60 

ctgaagcggc ttaaggggga ggagaggctt ctttggcttt acgaggaggt ggaaaagccc 1320 



900 
960 



869 



ctttcgcggg tcctggccca catggaggcc acgggggtac ggttggatgt ggcctactta 1380 
aaggcccttt ccctggaggt ggaggcggag ataaggcgct tcgaggagga ggtccaccgc 144 0 
ctggccgggc atcctttcaa cctgaactcc cgggaccagc tggaaagggt catctttgac 
gagcttgggc ttcccgccat caagaagacg aggaagacgg gcaagcgctc caccagcgcc 
gccgttttgg aggccttgcg ggaggctcat cccatcgtgg accgcatcct tcagtaccgg 
gagctttcca agctcaaggg aacctacatc gatcccttgc ctgccctggt ccaccccaag 
acgaaccgcc tccacacccg tttcaaccag acggccaccg ccacggggag gcttagcagc 
tcggatccta atctgcaaaa tatccccgtg cgcacccctt tgggccagcg gatccgccgg 
gccttcgtgg ccgaggaggg gtggaggctg gtggttttgg actacagcca gattgagctc 
agggtcctgg cgcacctttc cggggacgag aacctaatcc gggtcttcca ggagggccag 
gacatccaca cccagacggc cagctggatg ttcggcgtgc ccccagaggc cgtggattcc 
ctgatgcgcc gggcggccaa gaccatcaac ttcggcgtcc tctacggcat gtccgcccac 
cggctttcgg gagagctggc catcccctac gaggaggcgg tggccttcat cgagcggtat 
ttccagagct accccaaggt gcgggcctgg attgagaaaa ccctggcgga aggacgggaa 
cggggctatg tggaaaccct ctttggccgc cggcgctacg tgcccgactt ggcttcccgg 
gtgaagagca tccgggaggc agcggagcgc atggccttca acatgccggt ccaggggacc 
gccgcggatt tgatgaaact ggccatggtg aagctctttc ccaggcttca ggagctgggg 
gccaggatgc ttttgcaggt gcacaacgaa ctggtcctcg aggctcccaa ggagcaagcg 
gaggaagtcg cccaggaggc caagcggacc atggaggagg tgtggcccct gaaggtgccc 
ttggaggtgg aagtgggcat cggggaggac tggctttccg ccaaggcc 



<210> 448 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 448 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2508 



Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 
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Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 
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Tyr Arg Ala Glu Asp Pro Leu Glu Ala Leu Arg Gly Leu Gly Glu Val 
340 345 350 



Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly 
355 360 365 

lie Ala Leu Ala Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Ala Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Glu Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

Tyr Ala Ala Leu Leu Lys Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp 
420 425 430 

Leu Tyr Glu Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Lys Ala Leu Ser 
450 455 460 

Leu Glu Val Glu Ala Glu He Arg Arg Phe Glu Glu Glu Val His Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val He Phe Asp Glu Leu Gly Leu Pro Ala He Lys Lys Thr Arg Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Asp Arg He Leu Gin Tyr Arg Glu Leu Ser Lys 
530 535 540 

Leu Lys Gly Thr Tyr He Asp Pro Leu Pro Ala Leu Val His Pro Lys 
545 550 555 560 

Thr Asn Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp 
595 600 605 

Arg Leu Val Val Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Gin 
625 630 635 640 
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Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Ser Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gly Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Tyr 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Ala Glu Gly Arg Glu 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Ala Ser Arg Val Lys Ser He Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Gin Ala 
785 790 795 800 

Glu Glu Val Ala Gin Glu Ala Lys Arg Thr Met Glu Glu Val Trp Pro 
805 810 815 

Leu Lys Val Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Ala 
835 

<210> 449 

<211> 2505 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 449 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 
ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 
tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 



60 
120 
180 
240 
300 
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ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


Lcgagguccc 


yy 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 




420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 




cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agt-acgy cct 


«-y y ^ y y °-y 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg gacccctccg 


acaacc t ccc 


cgyyy i-<-ciciy 


600 


ggcatcgggg 


agaagaccgc 


cc ucaagc- 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagc uctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


/-V /"< f^r^ 

gcccy ac^cyy 


gaggggctta gggccttcct ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgaguucggc 


ctcctggagg 


cccccgcccc 


cctQQaocracr 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtcca 


v^t_.yyy^ca.ci\-'Ci. 


1 n o n 


agcccggttg 


aggccctggc 


cgacctcaag 


gaggcccggg 


ggttcctggc 


caay y ac(_ L.y 


1 n p n 

± U O u 


gccgttttgg 


ccctgcggga 


gggggtggcc 


ctggacccca 


cggacgaccc 




114 0 


gcctacctcc 


tggacccggc 


caacacccac 


cccgaggggg 


tggcccggcg 


ctacgggggc 


1200 


gagttcacgg 


aggacgcagc 


ggagagggcc 


ctcctctccg 


agaggctctt 


ccagaacctc 


1260 


tttaaacggc 


tttccgagaa 


gctcctctgg 


ctctaccagg 


aggtggagcg 


gcccctctcc 


X ^ z u 


cgggtcttgg 


cccacatgga 


ggcccggggg 


gtgaggctgg 


acgtccccct 


tctggaggcc 


1 ft n 

X ^ O VJ 


ctctcctttg 


agctggagaa 


ggagatggag 


cgcctggagg gggaggtctt 


ccgtttggcc 


1 44 0 


ggccacccct 


tcaacctcaa 


ctcccgcgac 


cagctggaaa 


gggtcctctt 


tgacgagctg 


X -J V w 


ggcctcaccc 


cggtgaagcg 


gacgaagaag 


acgggcaagc 


gctccaccgc 


ccagggggcc 


J, J o u 


ctggaggccc 


tccggggggc 


ccaccccatc 


gtggagctca 


tcctccagta 


ccgggagctt 


T ^ O A 


tccaagctca 


aaagcaccta 


cctggacccc 


ctgccccggc 


tcgtccaccc 


gcggacgggc 


T /T Q n 


cggctccaca 


cccgcttcaa 


ccagacggcc 


acggccacgg 


gaaggctttc 


cagctccgac 


1740 


cccaacctgc 


agaacatccc 


cgtgcgcacc 


cccttggggc 


agcgcatccg 


caaggccttc 


1800 


gtggccgagg 


aggggtggct 


ccttttggcg gcggactact 


cccagattga 


gctccgggtc 


1860 


ctggcccacc 


tctcggggga 


cgagaacctg 


aagcgggtct 


tccgggaggg 


gaaggacatc 


1920 


cataccgaga 


ccgccgcctg 


gatgttcggc 


ttagaccccg 


ctctggtgga 


tccaaagatg 


1980 
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cgccgggcgg 


ccaagacggt 


caacttcggc 


gtcctctacg 


ggatgtccgc 


ccacaggctc 


2040 


tcccaggagc 


tcggcataga 


ctacaaggag gcggaggcct 


ttattgagcg 


ctacttccag 


2100 


agcttcccca 


aggtgcgggc 


ctggatagaa 


aggaccctgg 


aggagggccg 


gacgcggggc 


2160 


tacgtggaga 


ccctgttcgg 


caggaggcgc 


tatgtgcccg 


acctggcctc 


ccgggtccgc 


2220 


tcggtgcggg 


aggcggcgga 


gcggatggcc 


ttcaacatgc 


ccgtgcaggg 


caccgccgcc 


2280 


gacctgatga 


agatcgccat 


ggtcaagctc 


ttccccaggc 


taaagcccct 


gggggcccac 


2340 


ctcctcctcc 


aagtggccaa 


cgagctggtc 


ctggaggtgc 


ccgaggaccg 


ggccgaggag 


2400 


gccaaggccc 


tggtcaagga 


ggtcatggag 


aacgcctacc 


ccctggacgt 


gcccctcgag 


2460 


gtggaggtgg 


gcgtgggtcg 


ggactggctg 


gaggcgaagc 


aggat 




2505 



<210> 450 

<211> 835 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 450 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
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145 



150 



155 



160 



Pro Glu Gly His Leu 
165 

Leu Arg Pro Glu Gin 
180 

Ser Asp Asn Leu Pro 
195 

Lys Leu Leu Lys Glu 
210 

Asp Arg Val Lys Pro 
225 

Glu Asp Leu Arg Leu 
245 

Pro Leu Glu Val Asp 
260 

Leu Arg Ala Phe Leu 
275 

Phe Gly Leu Leu Glu 
290 

Pro Pro Glu Gly Ala 
305 

Met Trp Ala Glu Leu 
325 

His Arg Ala Thr Ser 
340 

Arg Gly Phe Leu Ala 
355 

Val Ala Leu Asp Pro 
370 

Asp Pro Ala Asn Thr 
385 

Glu Phe Thr Glu Asp 
405 

Phe Gin Asn Leu Phe 
420 

Gin Glu Val Glu Arg 
435 

Arg Gly Val Arg Leu 



lie Thr Pro Glu Trp Leu 
170 

Trp Val Asp Phe Arg Ala 
185 

Gly Val Lys Gly He Gly 
200 

Trp Gly Ser Leu Glu Asn 
215 

Glu Asn Val Arg Glu Lys 
230 235 

Ser Leu Glu Leu Ser Arg 
250 

Leu Ala Gin Gly Arg Glu 
265 

Glu Arg Leu Glu Phe Gly 
280 

Ala Pro Ala Pro Leu Glu 
295 

Phe Val Gly Phe Val Leu 
310 315 

Lys Ala Leu Ala Ala Cys 
330 

Pro Val Glu Ala Leu Ala 
345 

Lys Asp Leu Ala Val Leu 
360 

Thr Asp Asp Pro Leu Leu 
375 

His Pro Glu Gly Val Ala 
390 395 

Ala Ala Glu Arg Ala Leu 
410 

Lys Arg Leu Ser Glu Lys 
425 

Pro Leu Ser Arg Val Leu 
440 

Asp Val Pro Leu Leu Glu 



Trp Glu Lys Tyr Gly 
175 

Leu Val Gly Asp Pro 
190 

Glu Lys Thr Ala Leu 
205 

Leu Leu Lys Asn Leu 
220 

He Lys Ala His Leu 
240 

Val Arg Thr Asp Leu 
255 

Pro Asp Arg Glu Gly 
270 

Ser Leu Leu His Glu 
285 

Glu Ala Pro Trp Pro 
300 

Ser Arg Pro Glu Pro 
320 

Arg Gly Gly Arg Val 
335 

Asp Leu Lys Glu Ala 
350 

Ala Leu Arg Glu Gly 
365 

Val Ala Tyr Leu Leu 
380 

Arg Arg Tyr Gly Gly 
400 

Leu Ser Glu Arg Leu 
415 

Leu Leu Trp Leu Tyr 
430 

Ala His Met Glu Ala 
445 

Ala Leu Ser Phe Glu 



876 



450 455 460 

Leu Glu Lys Glu Met Glu Arg Leu Glu Gly Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Gly Leu Thr Pro Val Lys Arg Thr Lys Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ala Gin Gly Ala Leu Glu Ala Leu Arg Gly Ala His 
515 520 525 

Pro He Val Glu Leu He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys 
530 535 540 

Ser Thr Tyr Leu Asp Pro Leu Pro Arg Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg He Arg Lys Ala Phe Val Ala Glu Glu Gly Trp Leu Leu 
595 600 605 

Leu Ala Ala Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu Lys Arg Val Phe Arg Glu Gly Lys Asp He 
625 630 635 640 

His Thr Glu Thr Ala Ala Trp Met Phe Gly Leu Asp Pro Ala Leu Val 
645 650 655 

Asp Pro Lys Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Gly He Asp Tyr 
675 680 685 

Lys Glu Ala Glu Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp He Glu Arg Thr Leu Glu Glu Gly Arg Thr Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala 
725 730 735 

Ser Arg Val Arg Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys He Ala Met Val 
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755 760 765 

Lys Leu Phe Pro Arg Leu Lys Pro Leu Gly Ala His Leu Leu Leu Gin 
770 775 780 

Val Ala Asn Glu Leu Val Leu Glu Val Pro Glu Asp Arg Ala Glu Glu 
785 790 795 800 

Ala Lys Ala Leu Val Lys Glu Val Met Glu Asn Ala Tyr Pro Leu Asp 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Val Gly Arg Asp Trp Leu Glu Ala 
820 825 830 

Lys Gin Asp 
835 

<210> 451 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 451 

cttctctcat ccgccaaaac agcc 



<210> 452 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 452 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 



24 



60 
12 0 
180 
240 



ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 
gggtacaagg ccgtcatcgt ggtctttgac gccgaggccc cctccttccg ccacgaggcc 
tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 
ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 3 60 
gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 42 0 
atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 



480 



cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 54 0 



cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 



600 



878 




879 



gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 

caccac 2526 



<210> 453 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 453 

ctcctccacg agttcggc 18 



<210> 454 
<211> 2514 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 454 

atgaattccc tgcccctctt tgagcccaag ggccgggtgc ttctggtgga cggccaccac 60 

ctggcctacc gtaccttttt tgccctgaag ggcctcacca ccagccgcgg ggagccggtc 12 0 

caggcggtgt acgggtttgc caagagcctt ttgaaggcgc taagggaaga cggggatgtg 180 

gtgatcgtgg tgtttgacgc caaggccccc tccttccgcc accagaccta cgaggcctac 240 

aaggcggggc gggctcccac ccccgaggac tttccccggc agcttgccct tatcaaggag 3 00 

atggtggacc ttttgggctt tacccgcctc gaggtgccgg gctttgaagc ggatgacgtc 3 60 

ctggctaccc tggccaagaa ggcggaaaag gaaggctacg aagtgcgcat cctcaccgcg 42 0 

gaccgggacc tttaccagct tctttcggag cgaatctcca tccttcaccc ggagggttac 480 

ctgatcaccc cggagtggct ttgggagaag tatgggctta agccttccca gtgggtggac 54 0 

taccgggcct tggccgggga cccttccgac aacatccccg gcgtgaaggg catcggggag 600 

aagacggcgg ccaagctgat ccgggagtgg ggaagcctgg aaaaccttct taagcacctg 660 

gaacaggtga aacctgcctc cgtgcgggag aagatcctta gccacatgga ggacctcaag 72 0 

ctatccctgg agctatcccg ggtgcacacg gacttgctcc ttcaggtgga cttcgcccgg 780 
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cgccgggagc cggaccggga ggggcttaag gcctttttgg agaggctgga gttcggaagc 84 0 

ctcctccacg agttcggcct gttggaaagc ccggtggcgg cggaggaagc tccctggccg 900 

ccccccgagg gagccttcgt ggggtacgtt ctttcccgcc ccgagcccat gtgggcggag 960 

cttaacgcct tggccgccgc ctggggcggc cgcgtgcacc gggcagcaga ccccttggcg 1020 

gggctaaagg acctcaagga ggtccggggc ctcctcgcca aggacctcgc cgtcttggcc 108 0 

tcgagggagg ggctagacct cgtgcccggg gacgacccca tgctcctcgc ctacctcctg 114 0 

gacccttcga acaccacccc cgagggggtg gcgcggcgct acggggggga gtggacggag 12 00 

gacgccgccc accgggccct cctctcggag aggctccatc ggaacctcct taagcgcctc 1260 

g^ggggg^gg agaagctcct ttggctctac cacgaggtgg aaaagcccct ctcccgggtc 132 0 

ctggcccata tggaggccac cggggtacgg cgggacgtgg cctaccttca ggccctttcc 13 8 0 

ctggagcttg cggaggagat ccgccgcctc gaggaggagg tcttccgctt ggcgggccac 144 0 

cccttcaacc tcaactcccg ggaccagctg gaaagggtgc tctttgacga gcttaggctt 1500 

cccgccttga agaagacgaa gaagacaggc aagcgctcca ccagcgccgc ggtgctggag 1560 

gccctacggg aggcccaccc catcgtggag aagatcctcc agcaccggga gctcaccaag 162 0 

ctcaagaaca cctacgtgga ccccctccca agcctcgtcc acccgaggac gggccgcctc 1680 

cacacccgct tcaaccagac ggccacggcc acggggaggc ttagtagctc cgaccccaac 174 0 

ctgcagaaca tccccgtccg cacccccttg ggccagagga tccgccgggc cttcgtggcc 1800 

gaggcgggtt gggcgttggt ggccctggac tatagccaga tagagctccg cgtcctcgcc 1860 

cacctctccg gggacgaaaa cctgatcagg gtcttccagg aggggaagga catccacacc 1920 

cagaccgcaa gctggatgtt cggcgtcccc ccggaggccg tggaccccct gatgcgccgg 1980 

gcggccaaga cggtgaactt cggcgtcctc tacggcatgt ccgcccatag gctctcccag 2040 

gagcttgcca tcccctacga ggaggcggtg gcctttatag agcgctactt ccaaagcttc 210 0 

cccaaggtgc gggcctggat agaaaagacc ctggaggagg ggaggaagcg gggctacgtg 2160 

gaaaccctct tcggaagaag gcgctacgtg cccgacctca acgcccgggt gaagagcgtc 2220 

^ggg^-ggccg cggagcgcat ggccttcaac atgcccgtcc agggcaccgc cgccgacctc 22 8 0 

atgaagctcg ccatggtgaa gctcttcccc cgcctccggg agatgggggc ccgcatgctc 2340 

ctccaggtcg ccaacgagct cctcctggag gccccccaag cgcgggccga ggaggtggcg 2400 

gctttggcca aggaggccat ggagaaggcc tatcccctcg ccgtgcccct ggaggtggag 2460 
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gtggggatgg 


gggaggactg 


gctttccgcc 


aagggtcacc 


accaccacca 


ccac 


2514 


<210> 455 
<211> 2526 
<212> DNA 
<213> Artificial 












<220> 

<223> Synthetic 












<400> 455 
atgaattccg 


^ggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcatcgt 


ggtctttgac 


gccgaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


g^sggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


C9^99tgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtatgggct 


taagccttcc 


540 


cagtgggtgg 


actaccgggc 


cttggccggg 


gacccttccg 


acaacatccc 


cggcgtgaag 


600 


ggcatcgggg 


agaagacggc 


ggccaagctg 


atccgggagt 


ggggaagcct 


ggaaaacctt 


660 


cttaagcacc 


tggaacaggt 


gaaacctgcc 


tccgtgcggg 


agaagatcct 


tagccacatg 


720 


gaggacctca 


agctatccct 


ggagctatcc 


cqqqtqcaca 


cggacttgct 


ccttcaggtg 


780 


gacttcgccc 


ggcgccggga 


qccqqaccqq 


qaqqqqctta 


aggccttttt 


qqaqaqqctq 


840 


gagttcggaa 


gcctcctcca 


cqaqttcqqc 


ctgttggaaa 


qcccqqtqqc 


qqcqqaqqaa 


900 


gctccctggc 


cgccccccga 


gggagccttc 


gtggggtacg 


ttctttcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaacgc 


cttggccgcc 


gcctggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


qaqqtccqqq 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


oqcrcf c t aqa c 


ctcgtgcccg 




ca tgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 



882 



caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 
ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 
gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 
gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 
gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 
acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 
tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 
gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 
cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 
gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 
ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 
aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 
ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 
cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 
gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 
gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 
gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 
gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 
ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 



caccac 



<210> 456 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 456 



Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 

15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



2340 
2400 
2460 
2520 
2526 



883 



Lys Gly Leu Thr Thr Ser Arg Giy Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val lie Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Lys Pro Ser Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro 
180 185 190 

Ser Asp Asn lie Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Ala 
195 200 205 

Lys Leu lie Arg Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu 
210 215 220 

Glu Gin Val Lys Pro Ala Ser Val Arg Glu Lys He Leu Ser His Met 
225 230 235 240 

Glu Asp Leu Lys Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu 
245 250 255 

Leu Leu Gin Val Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Lys Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val 
325 330 335 



884 



His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 



885 



Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 



Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 457 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 457 

ttccaggtgc ttgaggaggt tttccag 



<210> 458 

<211> 27 

<212> DNA 

<213> Artificial 



886 



<220> 

<223> Synthetic 
<400> 458 

ctcctcaagc acctggaaca ggtgaaa 



<210> 459 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 459 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 



27 



60 
120 
180 



420 
480 



ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 

gggtacaagg ccgtcatcgt ggtctttgac gccgaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 
gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 
atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 60 0 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagcacc tggaacaggt gaaacctgcc tccgtgcggg agaagatcct tagccacatg 72 0 

gaggacctca agctatccct ggagctatcc cgggtgcaca cggacttgct ccttcaggtg 780 

gacttcgccc ggcgccggga gccggaccgg gaggggctta aggccttttt ggagaggctg 840 
gagttcggaa gcctcctcca cgagttcggc ctgttggaaa gcccggtggc ggcggaggaa 
gctccctggc cgccccccga gggagccttc gtggggtacg ttctttcccg ccccgagccc 

atgtgggcgg agcttaacgc cttggccgcc gcctggggcg gccgcgtgca ccgggcagca 102 0 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 114 0 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 12 00 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 12 60 



900 
960 



887 



cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 132 0 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 13 80 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 15 00 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 198 0 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2 04 0 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 22 80 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 252 0 

caccac 2526 

<210> 460 
<211> 842 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 460 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 



888 



1 



5 



10 



15 



Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val He Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu 
210 215 220 

Glu Gin Val Lys Pro Ala Ser Val Arg Glu Lys He Leu Ser His Met 
225 230 235 240 

Glu Asp Leu Lys Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu 
245 250 255 

Leu Leu Gin Val Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Lys Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro 
290 295 300 



Pro Pro Glu Gly Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro 



889 



305 



310 



315 



320 



Met Trp Ala Glu Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 



Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 



890 



610 



615 



620 



His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 


461 


<211> 


27 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


461 



gtccaggttc ttgaggaggt tttccag 



<210> 462 



891 



<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 462 

ctcctcaaga acctggaccg ggtaaag 



27 



<210> 463 

<211> 2532 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 463 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 12 0 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcatcgt ggtctttgac gccgaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 3 00 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 420 
atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 



480 



cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 54 0 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccggac tcgctccggc gcaagataga ggcgcacctc 72 0 

gaggacctcc acctctcctt agacctggcc cgcatccgca ccgacctccc cctggaggtg 780 
gactttaagg ccctgcgccg caggaccccc gacctggagg gcctgagggc ctttttggag 
gagctggagt tcggaagcct cctccacgag ttcggcctcc tgggagggga gaagccccgg 
gaggaggccc cctggccccc gcccgaaggg gccttcgtgg gcttcctcct ttcccgcaag 

gagcccatgt gggcggagct tctggccctg gcggcggcct cgggcggccg cgtgcaccgg 1020 

gcagcagacc ccttggcggg gctaaaggac ctcaaggagg tccggggcct cctcgccaag 1080 

gacctcgccg tcttggcctc gagggagggg ctagacctcg tgcccgggga cgaccccatg 114 0 



840 
900 
960 



892 



ctcctcgcct 


acctcctgga 


cccttcgaac 


accacccccg 


agggggtggc 


gcggcgctac 


1200 


gggggggagt 


ggacggagga 


cgccgcccac 


cgggccctcc 


tctcggagag 


gctccatcgg 


1260 


aacctcctta 


agcgcctcga 


gggggaggag 


aagctccttt 


ggctctacca 


cgaggtggaa 


1320 


aagcccctct 


cccgggtcct 


ggcccatatg 


gaggccaccg 


gggtacggcg ggacgtggcc 


1380 


taccttcagg 


ccctttccct ggagcttgcg gaggagatcc gccgcctcga ggaggaggtc 


1440 


ttccgcttgg 


cgggccaccc 


rt" t* caacctc 

\^ \^ ^ \^ t-*. V* ^ ^ ^ 


aactcccggg 


accagctgga 


aagggtgctc 


1500 


tttgacgagc 


ttaggcttcc 


a p c t" t era aci 


aagacgaaga 


agacaggcaa 


gcgctccacc 


1560 


agcgccgcgg 


tgctggaggc 


^ v_, 3 Z3 J 


gcccacccca 


tcgtggagaa 


gatcctccag 


1620 


caccgggagc 


tcaccaagct 




taccrtctoacc 


ccctcccaag 


cctcgtccac 


1680 


ccgaggacgg gccgcctcca 




aaccacracQcr 


ccacggccac 


ggggaggctt 


1740 


agtagctccg 


accccaacct 


gcagaacatc 


cccgtccgca 


cccccttggg 


ccagaggatc 


1800 


cgccgggcct 


tcgtggccga 


ggcgggttgg 


gcgttggtgg 


ccctggacta 


tagccagata 


1860 


gagctccgcg 


tcctcgccca 


cctctccggg 


gacgaaaacc 


tgatcagggt 


cttccaggag 


1920 


gggaaggaca 


tccacaccca 


gaccgcaagc 


tggatgttcg 


gcgtcccc cc 


ggaggccgtg 


1980 


gaccccctga 


tgcgccgggc 


ggccaagacg 


gtgaacttcg 


rrr'O'hr'PiT'ta 


cggcatgtcc 


2 04 0 


gcccataggc 


tctcccagga 


gcttgccatc 


ccctacgagg 


aggcggtggc 


ctttatagag 


2100 


cgctacttcc 


aaagcttccc 


caaggtgcgg 


gcctggatag 


aaaagaccct 


ggaggagggg 


2160 


aggaagcggg 


gctacgtgga 


aaccctcttc 


ggaagaaggc 


gctacgtgcc 


cgacctcaac 


2220 


gcccgggtga agagcgtcag ggaggccgcg gagcgcatgg 


ccttcaacat 


gcccgtccag 


2280 


ggcaccgccg 


ccgacctcat 


gaagctcgcc 


atggtgaagc 


tcttcccccg 


cctccgggag 




atgggggccc 


gcatgctcct 


ccaggtcgcc 


aacgagctcc 


tcctggaggc 


cccccaagcg 


2400 


cgggccgagg 


aggtggcggc 


tttggccaag 


gaggccatgg 


agaaggccta 


tcccctcgcc 


2460 


gtgcccctgg 


aggtggaggt 


ggggatgggg 


gaggactggc 


tttccgccaa 


gggtcaccac 


2520 


caccaccacc 


ac 










2532 



<210> 464 

<211> 844 

<212> PRT 

<213> Artificial 

<220> 

<22 3> Synthetic 
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<400> 464 



Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val lie Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Asp Ser Leu Arg Arg Lys lie Glu Ala His Leu 
225 230 235 240 

Glu Asp Leu His Leu Ser Leu Asp Leu Ala Arg lie Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu 
260 265 270 

Glu Gly Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu 
275 280 285 
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His Glu Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro 
290 295 300 



Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys 
305 310 315 320 

Glu Pro Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly 
325 330 335 

Arg Val His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys 
340 345 350 

Glu Val Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg 
355 360 365 

Glu Gly Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr 
370 375 380 

Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr 
385 390 395 400 

Gly Gly Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu 
405 410 415 

Arg Leu His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu 
420 425 430 

Leu Trp Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala 
435 440 445 

His Met Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala 
450 455 460 

Leu Ser Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val 
465 470 475 480 

Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu 
485 490 495 

Glu Arg Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr 
500 505 510 

Lys Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu 
515 520 525 

Arg Glu Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu 
530 535 540 

Thr Lys Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His 
545 550 555 560 

Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala 
565 570 575 

Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val 
580 585 590 
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Arg Thr Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala 
595 600 605 

Gly Trp Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val 
610 615 620 

Leu Ala His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu 
625 630 635 640 

Gly Lys Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro 
645 650 655 

Pro Glu Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn 
660 665 670 

Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu 
675 680 685 

Ala He Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin 
690 695 700 

Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly 
705 710 715 720 

Arg Lys Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val 
725 730 735 

Pro Asp Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg 
740 745 750 

Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys 
755 760 765 

Leu Ala Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg 
770 775 780 

Met Leu Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala 
785 790 795 800 

Arg Ala Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala 
805 810 815 

Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp 
820 825 830 

Trp Leu Ser Ala Lys Gly His His His His His His 
835 840 



<210> 465 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 465 
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gacgtccttc ggggtgatga ggtggcc 



27 



<210> 466 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 466 

atcaccccga aggacgtcca ggagaag 2 7 

<210> 467 

<211> 2532 

<212> mA 

<213> Artificial 

<220> 

Jf <223> Synthetic 

<400> 467 

^ atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

hi ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

7 gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

IrJ; gggtacaagg ccgtcatcgt ggtctttgac gccgaggccc cctccttccg ccacgaggcc 240 

fy tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 3 00 

Q ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 420 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 4 80 

cccgagggcc acctcatcac cccgaaggac gtccaggaga agtacggggt gcccccggag 54 0 

cgctgggtgg acttccgcgc cctcacgggg gaccgctcgg acaacatccc cggggtggcg 600 

gggatagggg agaagaccgc ccttcgactc ctcgcagagt gggggagcgt ggaaaacctc 660 

ctgaagaacc tggaccgggt aaagccggac tcgctccggc gcaagataga ggcgcacctc 72 0 

gaggacctcc acctctcctt agacctggcc cgcatccgca ccgacctccc cctggaggtg 780 

gactttaagg ccctgcgccg caggaccccc gacctggagg gcctgagggc ctttttggag 84 0 

gagctggagt tcggaagcct cctccacgag ttcggcctcc tgggagggga gaagccccgg 900 

gaggaggccc cctggccccc gcccgaaggg gccttcgtgg gcttcctcct ttcccgcaag 960 

gagcccatgt gggcggagct tctggccctg gcggcggcct cgggcggccg cgtgcaccgg 1020 
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gcagcagacc ccttggcggg gctaaaggac ctcaaggagg tccggggcct cctcgccaag 1080 

gacctcgccg tcttggcctc gagggagggg ctagacctcg tgcccgggga cgaccccatg 1140 

ctcctcgcct acctcctgga cccttcgaac accacccccg agggggtggc gcggcgctac 1200 

gggggggagt ggacggagga cgccgcccac cgggccctcc tctcggagag gctccatcgg 126 0 

aacctcctta agcgcctcga gggggaggag aagctccttt ggctctacca cgaggtggaa 1320 

aagcccctct cccgggtcct ggcccatatg gaggccaccg gggtacggcg ggacgtggcc 13 8 0 

taccttcagg ccctttccct ggagcttgcg gaggagatcc gccgcctcga ggaggaggtc 1440 

ttccgcttgg cgggccaccc cttcaacctc aactcccggg accagctgga aagggtgctc 1500 

tttgacgagc ttaggcttcc cgccttgaag aagacgaaga agacaggcaa gcgctccacc 1560 

agcgccgcgg tgctggaggc cctacgggag gcccacccca tcgtggagaa gatcctccag 162 0 

caccgggagc tcaccaagct caagaacacc tacgtggacc ccctcccaag cctcgtccac 1680 

ccgaggacgg gccgcctcca cacccgcttc aaccagacgg ccacggccac ggggaggctt 174 0 

agtagctccg accccaacct gcagaacatc cccgtccgca cccccttggg ccagaggatc 1800 

cgccgggcct tcgtggccga ggcgggttgg gcgttggtgg ccctggacta tagccagata 1860 

gagctccgcg tcctcgccca cctctccggg gacgaaaacc tgatcagggt cttccaggag 192 0 

gggaaggaca tccacaccca gaccgcaagc tggatgttcg gcgtcccccc ggaggccgtg 1980 

gaccccctga tgcgccgggc ggccaagacg gtgaacttcg gcgtcctcta cggcatgtcc 204 0 

gcccataggc tctcccagga gcttgccatc ccctacgagg aggcggtggc ctttatagag 2100 

cgctacttcc aaagcttccc caaggtgcgg gcctggatag aaaagaccct ggaggagggg 2160 

aggaagcggg gctacgtgga aaccctcttc ggaagaaggc gctacgtgcc cgacctcaac 222 0 

gcccgggtga agagcgtcag ggaggccgcg gagcgcatgg ccttcaacat gcccgtccag 2280 

ggcaccgccg ccgacctcat gaagctcgcc atggtgaagc tcttcccccg cctccgggag 234 0 

a-tgggggccc gcatgctcct ccaggtcgcc aacgagctcc tcctggaggc cccccaagcg 2400 

cgggccgagg aggtggcggc tttggccaag gaggccatgg agaaggccta tcccctcgcc 2460 

gtgcccctgg aggtggaggt ggggatgggg gaggactggc tttccgccaa gggtcaccac 252 0 

caccaccacc ac 2532 

<210> 468 
<211> 844 
<212> PRT 
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<213> Artificial 



<220> 

<223> Synthetic 
<400> 468 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val lie Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly 
165 170 175 

Val Pro Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg 
180 185 190 

Ser Asp Asn lie Pro Gly Val Ala Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Arg Leu Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Asp Ser Leu Arg Arg Lys lie Glu Ala His Leu 
225 230 235 240 

Glu Asp Leu His Leu Ser Leu Asp Leu Ala Arg lie Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu 
260 265 270 
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Glu Gly Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu 
275 280 285 



His Glu Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro 
290 295 300 

Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys 
305 310 315 320 

Glu Pro Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly 
325 330 335 

Arg Val His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys 
340 345 350 

Glu Val Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg 
355 360 365 

Glu Gly Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr 
370 375 380 

Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr 
385 390 395 400 

Gly Gly Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu 
405 410 415 

Arg Leu His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu 
420 425 430 

Leu Trp Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala 
435 440 445 

His Met Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala 
450 455 460 

Leu Ser Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val 
465 470 475 480 

Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu 
485 490 495 

Glu Arg Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr 
500 505 510 

Lys Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu 
515 520 525 

Arg Glu Ala His Pro lie Val Glu Lys He Leu Gin His Arg Glu Leu 
530 535 540 

Thr Lys Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His 
545 550 555 560 

Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala 
565 570 575 



900 



Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val 
580 585 590 

Arg Thr Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala 
595 600 605 

Gly Trp Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val 
610 615 620 

Leu Ala His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu 
625 630 635 640 

Gly Lys Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro 
645 650 655 

Pro Glu Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn 
660 665 670 

Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu 
675 680 685 

Ala He Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin 
690 695 700 

Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly 
705 710 715 720 

Arg Lys Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val 
725 730 735 

Pro Asp Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg 
740 745 750 

Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys 
755 760 765 

Leu Ala Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg 
770 775 780 

Met Leu Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala 
785 790 795 800 

Arg Ala Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala 
805 810 815 

Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp 
820 825 830 

Trp Leu Ser Ala Lys Gly His His His His His His 
835 840 

<210> 469 

<211> 27 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 469 

ctcgaggcgg gtaaacccca ggaggtc 



<210> 470 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 470 

gggtttaccc gcctcgaggt gcccggc 



<210> 471 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 471 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 



27 



27 



60 
120 
180 
240 
300 
360 



ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 
gggtacaagg ccgtcatcgt ggtctttgac gccgaggccc cctccttccg ccacgaggcc 
tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 
ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtgcc gggctttgaa 
gcggatgacg tcctggctac cctggccaag aaggcggaaa aggaaggcta cgaagtgcgc 420 
atcctcaccg cggaccggga cctttaccag cttctttcgg agcgaatctc catccttcac 
ccggagggtt acctgatcac cccggagtgg ctttgggaga agtatgggct taagccttcc 
cagtgggtgg actaccgggc cttggccggg gacccttccg acaacatccc cggcgtgaag 
ggcatcgggg agaagacggc ggccaagctg atccgggagt ggggaagcct ggaaaacctt 660 
cttaagcacc tggaacaggt gaaacctgcc tccgtgcggg agaagatcct tagccacatg 720 
gaggacctca agctatccct ggagctatcc cgggtgcaca cggacttgct ccttcaggtg 780 
gacttcgccc ggcgccggga gccggaccgg gaggggctta aggccttttt ggagaggctg 84 0 
gagttcggaa gcctcctcca cgagttcggc ctgttggaaa gcccggtggc ggcggaggaa 900 



480 
540 
600 
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gctccctggc 


cgccccccga gggagccttc 


gtggggtacg 


ttctttcccg 


ccccgagccc 




atgtgggcgg 


agcttaacgc 




gcctggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1 A o n 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1 14 U 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


T o A n 
1^ U U 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


Izo U 


cttaagcgcc 


tcgaggggga ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


132 0 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


li bU 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1 /I /I A 

144 U 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1 c; A A 


gagcttaggc 


ttcccgcctt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


T C A 

iDO U 


gcggtgctgg 


aggccctacg ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


162 0 


gagctcacca 


agctcaagaa 


cacctacgtg gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


T O /T A 
lo D U 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 




T Q O A 

ly Z U 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 


caccac 












2526 
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<210> 472 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 472 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val He Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Leu Ser Glu Arg He Ser He Leu His 
145 150 155 160 

Pro Glu Gly Tyr Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Lys Pro Ser Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro 
180 185 190 

Ser Asp Asn He Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Ala 
195 200 205 

Lys Leu He Arg Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu 
210 215 220 

Glu Gin Val Lys Pro Ala Ser Val Arg Glu Lys He Leu Ser His Met 
225 230 235 240 

Glu Asp Leu Lys Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu 



904 



245 250 255 

Leu Leu Gin Val Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Lys Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
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545 



550 



555 



560 



Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 473 
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<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 473 

aagccactcc ggggtgatca ggtaacc 



<210> 474 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 474 

atcaccccgg agtggctttg ggagaag 



<210> 475 

<211> 2514 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 475 

atgaattccc tgcccctctt tgagcccaag ggccgggtgc ttctggtgga cggccaccac 



27 



27 



ctggcctacc gtaccttttt tgccctgaag ggcctcacca ccagccgcgg ggagccggtc 
caggcggtgt acgggtttgc caagagcctt ttgaaggcgc taagggaaga cggggatgtg 
gtgatcgtgg tctttgacgc cgaggccccc tccttccgcc accagaccta cgaggcctac 
aaggcggggc gggctcccac ccccgaggac tttccccggc agcttgccct tatcaaggag 
atggtggacc ttttgggcct ggagcgcctc gaggtgccgg gctttgaagc ggatgacgtc 
ctggctaccc tggccaagaa ggcggaaaag gaaggctacg aagtgcgcat cctcaccgcg 
gaccgggacc tttaccagct tctttcggag cgaatctcca tccttcaccc ggagggttac 
ctgatcaccc cggagtggct ttgggagaag tacggcctca ggccggagca gtgggtggac 
ttccgcgccc tcgtggggga cccctccgac aacctccccg gggtcaaggg catcggggag 
aagaccgccc tcaagctcct caaggagtgg ggaagcctgg aaaacctcct caagaacctg 
gaccgggtaa agccagaaaa cgtccgggag aagatcaagg cccacctgga agacctcagg 720 
ctctccttgg agctctcccg ggtgcgcacc gacctccccc tggaggtgga cctcgcccag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



780 
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999cgggagc ccgaccggga ggggcttagg gccttcctgg agaggctgga gttcggcagc 84 0 

ctcctccacg agttcggcct cctggaggcc cccgcccccc tggaggaggc cccctggccc 900 

ccgccggaag gggccttcgt gggcttcgtc ctctcccgcc ccgagcccat gtgggcggag 960 

cttaaagccc tggccgcctg caggggcggc cgcgtgcacc gggcagcaga ccccttggcg 1020 

gggctaaagg acctcaagga ggtccggggc ctcctcgcca aggacctcgc cgtcttggcc 10 8 0 

tcgagggagg ggctagacct cgtgcccggg gacgacccca tgctcctcgc ctacctcctg 1140 

gacccttcga acaccacccc cgagggggtg gcgcggcgct acggggggga gtggacggag 120 0 

gacgccgccc accgggccct cctctcggag aggctccatc ggaacctcct taagcgcctc 1260 

g^ggggg^gg agaagctcct ttggctctac cacgaggtgg aaaagcccct ctcccgggtc 132 0 

ctggcccata tggaggccac cggggtacgg cgggacgtgg cctaccttca ggccctttcc 13 8 0 

ctggagcttg cggaggagat ccgccgcctc gaggaggagg tcttccgctt ggcgggccac 1440 

cccttcaacc tcaactcccg ggaccagctg gaaagggtgc tctttgacga gcttaggctt 1500 

cccgccttga agaagacgaa gaagacaggc aagcgctcca ccagcgccgc ggtgctggag 1560 

gccctacggg aggcccaccc catcgtggag aagatcctcc agcaccggga gctcaccaag 1620 

ctcaagaaca cctacgtgga ccccctccca agcctcgtcc acccgaggac gggccgcctc 1680 

cacacccgct tcaaccagac ggccacggcc acggggaggc ttagtagctc cgaccccaac 1740 

ctgcagaaca tccccgtccg cacccccttg ggccagagga tccgccgggc cttcgtggcc 1800 

gaggcgggtt gggcgttggt ggccctggac tatagccaga tagagctccg cgtcctcgcc 1860 

cacctctccg gggacgaaaa cctgatcagg gtcttccagg aggggaagga catccacacc 1920 

cagaccgcaa gctggatgtt cggcgtcccc ccggaggccg tggaccccct gatgcgccgg 1980 

gcggccaaga cggtgaactt cggcgtcctc tacggcatgt ccgcccatag gctctcccag 2 04 0 

gagcttgcca tcccctacga ggaggcggtg gcctttatag agcgctactt ccaaagcttc 2100 

cccaaggtgc gggcctggat agaaaagacc ctggaggagg ggaggaagcg gggctacgtg 2160 

gaaaccctct tcggaagaag gcgctacgtg cccgacctca acgcccgggt gaagagcgtc 2220 

agggaggccg cggagcgcat ggccttcaac atgcccgtcc agggcaccgc cgccgacctc 2280 

atgaagctcg ccatggtgaa gctcttcccc cgcctccggg agatgggggc ccgcatgctc 2340 

ctccaggtcg ccaacgagct cctcctggag gccccccaag cgcgggccga ggaggtggcg 2400 

gctttggcca aggaggccat ggagaaggcc tatcccctcg ccgtgcccct ggaggtggag 2460 



908 



9tggggatgg gggaggactg gctttccgcc aagggtcacc accaccacca ccac 



2514 



.J". 



<210> 476 

<211> 838 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 476 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Glu Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 



Leu lie Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
O 100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg lie Ser lie Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro Glu 
165 170 175 

Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro Ser Asp Asn Leu 
180 185 190 

Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu Lys Leu Leu Lys 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp Arg Val Lys 
210 215 220 

Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu Glu Asp Leu Arg 
225 230 235 240 
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Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu Pro Leu Gin Val 
245 250 255 



Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly Leu Arg Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val His Arg Ala Ala 
325 330 335 

Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu Val 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn Leu 
405 410 415 

Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu Ala 
450 455 460 

Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 495 

Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 520 525 

Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn Thr 
530 535 540 
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Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val Ala 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His Thr 
625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Pro 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu 
675 680 685 

Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 

Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg 
725 730 735 

Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val Ala 
770 775 780 

Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val Ala 
785 790 795 800 

Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys Gly 
820 825 830 

His His His His His His 
835 
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<210> 477 

<211> 64 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 477 

caccaccacc accaccacgt cgactagtgc tagcgtcgac tagctgcagg catgcaagct 60 
tggc 64 



<210> 478 

<211> 64 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 478 

gccaagcttg catgcctgca gctagtcgac gctagcacta gtcgacgtgg tggtggtggt 60 
ggtg 64 



<210> 479 

<211> 41 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 479 

caggaagcgg ccgcgtcgac atgaccatga ttacgccaag c 41 



<210> 480 

<211> 42 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 480 

gggcccgcca gggtcgactc agggcgatgg cccactacgt ga 42 



<2ia> 481 

<211> 3135 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 481 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag gccgggtcct 


cctggtggac 


b U 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg gcctcaccac 


gagccggggc 


J.Z U 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc tcaaggccct 


gaaggaggac 


T Q A 

loU 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc cctccttccg 


ccacgaggcc 


24 U 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg acttcccccg gcagctcgcc 


T n A 
J U U 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc tcgaggtccc 


cggctacgag 


J o u 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa aggaggggta 


cgaggtgcgc 


ft Z u 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg accgcgtcgc 


cgtcctccac 




cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga agtacggcct 


caggccggag 


A n 


cagtgggtgg 


acttccgcgc 


cctcgtgggg gacccctccg acaacctccc 


cggggtcaag 


£r n A 
o U U 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt ggggaagcct 


ggaaaacctc 


c /z r\ 
O D U 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg agaagatcaa 


ggcccacctg 


72 0 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca ccgacctccc 


cctggaggtg 


78 0 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta gggccttcct 


ggagaggctg 


840 


gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg gccgcgtgca ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag gaggtccggg gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


gaagaagacg 


aagaagacag gcaagcgctc 


caccagcgcc 


1560 
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gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 162 0 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 174 0 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 192 0 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 198 0 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 22 80 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 24 00 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 24 6 0 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 252 0 

caccacgtcg acatgaccat gattacgcca agctatttag gtgacactat agaatactca 2580 

agctatgcat caagcttggt accgagctcg gatccactag taacggccgc cagtgtgctg 264 0 

gaattctgca gatatccatc acactggcgg ccgctcgagc atgcatctag agggcccaat 2700 

tcgccctata gtgagtcgta ttacaattca ctggccgtcg ttttacaacg tcgtgactgg 2760 

gaaaaccctg gcgttaccca acttaatcgc cttgcagcac atcccccttt cgccagctgg 2820 

cgtaatagcg aagaggcccg caccgatcgc ccttcccaac agttgcgcag cctgaatggc 2880 

gaatgggacg cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc 2 940 

gtgaccgcta cacttgccag cgccctagcg cccgctcctt tcgctttctt cccttccttt 3000 

ctcgccacgt tcgccggctt tccccgtcaa gctctaaatc gggggctccc tttagggttc 3060 

cgatttagag ctttacggca cctcgaccgc aaaaaacttg atttgggtga tggttcacgt 3120 

agtgggccat cgccc 3135 

<210> 482 
<211> 1045 
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<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 482 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
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260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
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565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His Val Asp Met Thr Met He 
835 840 845 

Thr Pro Ser Tyr Leu Gly Asp Thr He Glu Tyr Ser Ser Tyr Ala Ser 
850 855 860 

Ser Leu Val Pro Ser Ser Asp Pro Leu Val Thr Ala Ala Ser Val Leu 
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865 870 



875 880 



Glu Phe Cys Arg Tyr Pro Ser His Trp Arg Pro Leu Glu His Ala Ser 
885 890 895 

Arg Gly Pro Asn Ser Pro Tyr Ser Glu Ser Tyr Tyr Asn Ser Leu Ala 
900 905 910 

Val Val Leu Gin Arg Arg Asp Trp Glu Asn Pro Gly Val Thr Gin Leu 
915 920 925 

Asn Arg Leu Ala Ala His Pro Pro Phe Ala Ser Trp Arg Asn Ser Glu 
930 935 940 

Glu Ala Arg Thr Asp Arg Pro Ser Gin Gin Leu Arg Ser Leu Asn Gly 
945 950 955 960 

Glu Trp Asp Ala Pro Cys Ser Gly Ala Leu Ser Ala Ala Gly Val Val 
965 970 975 

Val Thr Arg Ser Val Thr Ala Thr Leu Ala Ser Ala Leu Ala Pro Ala 
980 985 990 

Pro Phe Ala Phe Phe Pro Ser Phe Leu Ala Thr Phe Ala Gly Phe Pro 
995 1000 1005 

Arg Gin Ala Leu Asn Arg Gly Leu Pro Leu Gly Phe Arg Phe Arg 
1010 1015 1020 

Ala Leu Arg His Leu Asp Arg Lys Lys Leu Asp Leu Gly Asp Gly 
1025 1030 1035 

Ser Arg Ser Gly Pro Ser Pro 
1040 1045 

<210> 483 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 
<400> 483 

cgggacctcg aggcgcgtga accccaggag gtccac 



<210> 484 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 484 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 
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caccacctgg 


cctaccgcac 


cttccacgcc ctgaagggcc tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag agcctcctca aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg gaggactttc cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggttcacg cgcctcgagg tcccgggcta 


cgaggcggac 


J D U 


gacgtcctgg 


ccagcctggc 


caagaaggcg gaaaaggagg gctacgaggt 


ccgcatcctc 


42 0 


accgccgaca 


aagaccttta 


ccagctcctt tccgaccgca tccacgtcct 


ccaccccgag 


4 8 0 


gggtacctca 


tcaccccggc 


ctggctttgg gaaaagtacg gcctgaggcc 


cgaccagtgg 


54 0 


gccgactacc 


gggccctgac 


cggggacgag tccgacaacc ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa gcttctggag gagtggggga gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg gagaagatcc tggcccacat 


ggacgatctg 


7 9 0 


aagctctcct 


gggacctggc 


caaggtgcgc accgacctgc ccctggaggt 


ggacttcgcc 


7 P! n 


aaaaggcggg 


agcccgaccg 


ggagaggctt agggccuuuc ugyagayyi^u 


tgagtttggc 


8 4 0 


agcctcctcc 


acgagttcgg 


ccttctggaa agccccaagg ccctggagga 


ggccccctgg 


90 0 


cccccgccgg 


aaggggcctt 


cgtgggcttt gtgctttccc gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc ggccgcgtgc accgggcagc 


agaccccttg 


X \J ^ W 


gcggggctaa 


aggacctcaa 


ggaggtccgg ggcctcctcg ccaaggacct 


CQCCQtCttq 


T n Q Pi 
± U o U 


gcctcgaggg 


aggggctaga 


cctcgtgccc ggggacgacc ccatgctcct 


cgcctacct c 




ctggaccctt 


cgaacaccac 


ccccgagggg gtggcgcggc gctacggggg 


aaaataaaccr 


1 o n ri 


gaggacgccg 


cccaccgggc 


cctcctctcg gagaggctcc atcggaacct 


ccttaagcgc 




ctcgaggggg 


aggagaagct 


cctttggctc taccacgagg tggaaaagcc 


cctctcccgg 


132 0 


gtcctggccc 


atatggaggc 


caccggggta cggcgggacg tggcctacct 


tcaggccctt 


13 80 


tccctggagc 


ttgcggagga 


gatccgccgc ctcgaggagg aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag ctggaaaggg tgctctttga 


cgagcttagg 


15 00 


cttcccgcct 


tgaagaagac 


gaagaagaca ggcaagcgct ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg gagaagatcc tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc ccaagcctcg tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg gccacgggga ggcttagtag 


ctccgacccc 


1740 
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aacctgcaga acatccccgt ccgcaccccc ttgggccaga ggatccgccg ggccttcgtg 1800 

gccgaggcgg gttgggcgtt ggtggccctg gactatagcc agatagagct ccgcgtcctc 1860 

gcccacctct ccggggacga aaacctgatc agggtcttcc aggaggggaa ggacatccac 192 0 

acccagaccg caagctggat gttcggcgtc cccccggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agacggtgaa cttcggcgtc ctctacggca tgtccgccca taggctctcc 2 04 0 

caggagcttg ccatccccta cgaggaggcg gtggccttta tagagcgcta cttccaaagc 210 0 

ttccccaagg tgcgggcctg gatagaaaag accctggagg aggggaggaa gcggggctac 2160 

gtggaaaccc tcttcggaag aaggcgctac gtgcccgacc tcaacgcccg ggtgaagagc 2220 

gtcagggagg ccgcggagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 80 

ctcatgaagc tcgccatggt gaagctcttc ccccgcctcc gggagatggg ggcccgcatg 2340 

ctcctccagg tcgccaacga gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 240 0 

gcggctttgg ccaaggaggc catggagaag gcctatcccc tcgccgtgcc cctggaggtg 2460 

gaggtgggga tgggggagga ctggctttcc gccaagggtc accaccacca ccaccac 2517 



<210> 485 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 485 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 
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Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 IVO 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 
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Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 
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Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 



Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Gly His His His His His His 
835 



<210> 


486 


<211> 


36 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


486 



gtggaccttc tgggctttac ccgcctcgag gccccg 



<210> 


487 


<211> 


36 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


487 



cggggcctcg aggcgggtaa agcccagaag gtccac 



<210> 488 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 488 

Met Asn Ser Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 

Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg He Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn He Pro Gly Val Ala Gly He Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys He Glu Ala His Leu Glu Asp 
225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg He Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
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290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
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595 



600 605 



Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 

840 





835 


<210> 


489 


<211> 


24 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


489 



tttacccgcc tcgaggtgcc gggc 
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<210> 490 

<211> 36 

<212> DNA 

<213> Artificial 



<220> 

<22 3> Synthetic 



<400> 490 

cggcacctcg aggcgggtaa agcccaaaag gtccac 



<210> 491 

<211> 838 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 491 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu lie Lys Glu Met Val Asp Leu Leu Gly Phe Thr Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg He Ser He Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 
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Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn lie 
180 185 190 



Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Ala Lys Leu lie Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 215 220 

Pro Ala Ser Val Arg Glu Lys lie Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val His Arg Ala Ala 
325 330 335 

Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu Val 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn Leu 
405 410 415 

Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu Ala 
450 455 460 

Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His 
465 470 475 480 
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Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 495 



Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro lie 
515 520 525 

Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn Thr 
530 535 540 

Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val Ala 
595 600 605 

Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie His Thr 
625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Pro 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu Glu 
675 680 685 

Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 

Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg 
725 730 735 

Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val Ala 
770 775 780 
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Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val Ala 
785 790 795 800 



Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys Gly 
820 825 830 

His His His His His His 
835 

<210> 492 

<211> 34 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 492 

atcgtggtct ttgacgccga ggccccctcc ttcc 34 



<210> 493 

<211> 34 

<212> DNA 

<213> Artificial 



<220> 

<22 3> Synthetic 
<400> 493 

ggaaggaggg ggcctcggcg tcaaagacca cgat 34 



<210> 494 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 494 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 
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Val Phe Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 
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Leu Asp Leu Val Pro 
370 

Asp Pro Ser Asn Thr 
385 

Glu Trp Thr Glu Asp 
405 

His Arg Asn Leu Leu 
420 

Leu Tyr His Glu Val 
435 

Glu Ala Thr Gly Val 
450 

Leu Glu Leu Ala Glu 
465 

Leu Ala Gly His Pro 
485 

Val Leu Phe Asp Glu 
500 

Thr Gly Lys Arg Ser 
515 

Ala His Pro He Val 
530 

Leu Lys Asn Thr Tyr 
545 

Thr Gly Arg Leu His 
565 

Arg Leu Ser Ser Ser 
580 

Pro Leu Gly Gin Arg 
595 

Ala Leu Val Ala Leu 
610 

His Leu Ser Gly Asp 
625 

Asp He His Thr Gin 
645 

Ala Val Asp Pro Leu 
660 



Gly Asp Asp Pro Met Leu 
375 

Thr Pro Glu Gly Val Ala 
390 395 

Ala Ala His Arg Ala Leu 
410 

Lys Arg Leu Glu Gly Glu 
425 

Glu Lys Pro Leu Ser Arg 
440 

Arg Arg Asp Val Ala Tyr 
455 

Glu lie Arg Arg Leu Glu 
470 475 

Phe Asn Leu Asn Ser Arg 
490 

Leu Arg Leu Pro Ala Leu 
505 

Thr Ser Ala Ala Val Leu 
520 

Glu Lys He Leu Gin His 
535 

Val Asp Pro Leu Pro Ser 
550 555 

Thr Arg Phe Asn Gin Thr 
570 

Asp Pro Asn Leu Gin Asn 
585 

He Arg Arg Ala Phe Val 
600 

Asp Tyr Ser Gin He Glu 
615 

Glu Asn Leu He Arg Val 
630 635 

Thr Ala Ser Trp Met Phe 
650 

Met Arg Arg Ala Ala Lys 
665 



Leu Ala Tyr Leu Leu 
380 

Arg Arg Tyr Gly Gly 
400 

Leu Ser Glu Arg Leu 

415 

Glu Lys Leu Leu Trp 
430 

Val Leu Ala His Met 
445 

Leu Gin Ala Leu Ser 
460 

Glu Glu Val Phe Arg 
480 

Asp Gin Leu Glu Arg 
495 

Lys Lys Thr Lys Lys 
510 

Glu Ala Leu Arg Glu 
525 

Arg Glu Leu Thr Lys 
540 

Leu Val His Pro Arg 
560 

Ala Thr Ala Thr Gly 
575 

He Pro Val Arg Thr 
590 

Ala Glu Ala Gly Trp 
605 

Leu Arg Val Leu Ala 
620 

Phe Gin Glu Gly Lys 
640 

Gly Val Pro Pro Glu 
655 

Thr Val Asn Phe Gly 
670 
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Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Plie Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 495 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 495 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 6 0 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccgag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 30 0 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 3 60 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 
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accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 
gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 
gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 
ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 
gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 
ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 
gtcctggccc atatggaggc caccggggta cggcgggacg tggcctacct tcaggccctt 
tccctggagc ttgcggagga gatccgccgc ctcgaggagg aggtcttccg cttggcgggc 
caccccttca acctcaactc ccgggaccag ctggaaaggg tgctctttga cgagcttagg 
cttcccgcct tgaagaagac gaagaagaca ggcaagcgct ccaccagcgc cgcggtgctg 
gaggccctac gggaggccca ccccatcgtg gagaagatcc tccagcaccg ggagctcacc 
aagctcaaga acacctacgt ggaccccctc ccaagcctcg tccacccgag gacgggccgc 
ctccacaccc gcttcaacca gacggccacg gccacgggga ggcttagtag ctccgacccc 
aacctgcaga acatccccgt ccgcaccccc ttgggccaga ggatccgccg ggccttcgtg 
gccgaggcgg gttgggcgtt ggtggccctg gactatagcc agatagagct ccgcgtcctc 
gcccacctct ccggggacga aaacctgatc agggtcttcc aggaggggaa ggacatccac 
acccagaccg caagctggat gttcggcgtc cccccggagg ccgtggaccc cctgatgcgc 
cgggcggcca agacggtgaa cttcggcgtc ctctacggca tgtccgccca taggctctcc 
caggagcttg ccatccccta cgaggaggcg gtggccttta tagagcgcta cttccaaagc 
ttccccaagg tgcgggcctg gatagaaaag accctggagg aggggaggaa gcggggctac 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
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gtggaaaccc tcttcggaag aaggcgctac gtgcccgacc tcaacgcccg ggtgaagagc 2220 

gtcagggagg ccgcggagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 80 

ctcatgaagc tcgccatggt gaagctcttc ccccgcctcc gggagatggg ggcccgcatg 2340 

ctcctccagg tcgccaacga gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 2400 

gcggctttgg ccaaggaggc catggagaag gcctatcccc tcgccgtgcc cctggaggtg 2460 

gaggtgggga tgggggagga ctggctttcc gccaagggtc accaccacca ccaccac 2517 



<210> 496 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 496 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 
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Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 

450 455 460 

Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 
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His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 VOO 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 
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Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 

785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 

805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 



Gly His His His His His His 
835 

<210> 497 
<211> 2526 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 497 

atgaattcca ccccactttt tgacctggag gaacccccca agcgggtgct tctggtggac 60 

ggccaccacc tggcctaccg caccttctat gccctgagcc tcaccacctc ccggggggag 120 

ccggtgcaga tggtctacgg cttcgcccgg agcctcctca aggccttgaa ggaggacgga 180 

caggcggtgg tcgtggtctt tgacgccgag gccccctcct tccgccacga ggcctacgag 24 0 

gcctacaagg cgggccgggc ccccaccccg gaggacttcc cccgccagct cgccttggtc 3 00 

aagcggctgg tggaccttct gggcctggtc cgcctcgagg ccccggggta cgaggcggac 360 

gacgtcctgg gcaccctggc caagaaggcc gaaagggagg ggatggaggt gcgcatcctc 420 

acgggagacc gggacttctt ccagctcctc tccgagaagg tctcggtcct cctgccggac 480 

gggaccctgg tcaccccaaa ggacgtccag gagaagtacg gggtgccccc ggagcgctgg 540 

gtggacttcc gcgccctcac gggggaccgc tcggacaaca tccccggggt ggcggggata 600 

ggggagaaga ccgcccttcg actcctcgca gagtggggga gcgtggaaaa cctcctgaag 660 

aacctggacc gggtaaagcc ggactcgctc cggcgcaaga tagaggcgca cctcgaggac 720 

ctccacctct ccttagacct ggcccgcatc cgcaccgacc tccccctgga ggtggacttt 7 80 

aaggccctgc gccgcaggac ccccgacctg gagggcctga gggccttttt ggaggagctg 84 0 

gagttcggaa gcctcctcca cgagttcggc ctcctgggag gggagaagcc ccgggaggag 900 

gccccctggc ccccgcccga aggggccttc gtgggcttcc tcctttcccg caaggagccc 960 

atgtgggcgg agcttctggc cctggcggcg gcctcgggcg gccgcgtgca ccgggcagca 1020 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 10 8 0 
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gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cc cgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg gtggccctgg actatagcca gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 


caccac 












2526 



<210> 498 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 
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<223> Synthetic 
<400> 498 

Met Asn Ser Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 

Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg lie Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn lie Pro Gly Val Ala Gly lie Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys lie Glu Ala His Leu Glu Asp 
225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg lie Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 
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Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 



Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 
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Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 



Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 499 

<211> 2514 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 499 

atgaattccc tgcccctctt tgagcccaag ggccgggtgc ttctggtgga cggccaccac 
ctggcctacc gtaccttttt tgccctgaag ggcctcacca ccagccgcgg ggagccggtc 
caggcggtgt acgggtttgc caagagcctt ttgaaggcgc taagggaaga cggggatgtg 
gtgatcgtgg tctttgacgc cgaggccccc tccttccgcc accagaccta cgaggcctac 
aaggcggggc gggctcccac ccccgaggac tttccccggc agcttgccct tatcaaggag 
atggtggacc ttttgggcct ggagcgcctc gaggtgccgg gctttgaagc ggatgacgtc 
ctggctaccc tggccaagaa ggcggaaaag gaaggctacg aagtgcgcat cctcaccgcg 
gaccgggacc tttaccagct tctttcggag cgaatctcca tccttcaccc ggagggttac 
ctgatcaccc cggagtggct ttgggagaag tatgggctta agccttccca gtgggtggac 
taccgggcct tggccgggga cccttccgac aacatccccg gcgtgaaggg catcggggag 
aagacggcgg ccaagctgat ccgggagtgg ggaagcctgg aaaaccttct taagcacctg 
gaacaggtga aacctgcctc cgtgcgggag aagatcctta gccacatgga ggacctcaag 
ctatccctgg agctatcccg ggtgcacacg gacttgctcc ttcaggtgga cttcgcccgg 
cgccgggagc cggaccggga ggggcttaag gcctttttgg agaggctgga gttcggaagc 
ctcctccacg agttcggcct gttggaaagc ccggtggcgg cggaggaagc tccctggccg 
ccccccgagg gagccttcgt ggggtacgtt ctttcccgcc ccgagcccat gtgggcggag 
cttaacgcct tggccgccgc ctggggcggc cgcgtgcacc gggcagcaga ccccttggcg 
gggctaaagg acctcaagga ggtccggggc ctcctcgcca aggacctcgc cgtcttggcc 
tcgagggagg ggctagacct cgtgcccggg gacgacccca tgctcctcgc ctacctcctg 
gacccttcga acaccacccc cgagggggtg gcgcggcgct acggggggga gtggacggag 
gacgccgccc accgggccct cctctcggag aggctccatc ggaacctcct taagcgcctc 
gagggggagg agaagctcct ttggctctac cacgaggtgg aaaagcccct ctcccgggtc 132 0 
ctggcccata tggaggccac cggggtacgg cgggacgtgg cctaccttca ggccctttcc 
ctggagcttg cggaggagat ccgccgcctc gaggaggagg tcttccgctt ggcgggccac 
cccttcaacc tcaactcccg ggaccagctg gaaagggtgc tctttgacga gcttaggctt 
cccgccttga agaagacgaa gaagacaggc aagcgctcca ccagcgccgc ggtgctggag 
gccctacggg aggcccaccc catcgtggag aagatcctcc agcaccggga gctcaccaag 
ctcaagaaca cctacgtgga ccccctccca agcctcgtcc acccgaggac gggccgcctc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



1380 
1440 
1500 
1560 
1620 
1680 
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cacacccgct 


tcaaccagac 


ggccacggcc 


acggggaggc 


ttagtagctc 


cgaccccaac 


1740 


ctgcagaaca 


tccccgtccg 


cacccccttg 


ggccagagga 


tccgccgggc 


cttcgtggcc 


1800 


qaqqcqqqtt 


qqqcqttqqt 


qqccctqqac 


tatagccaga 


tagagctccg 


cgtcctcgcc 


1860 


cacctctccg 


gggscgaaaa 


cctgatcagg 


gtcttccagg 


^gggg^^gg^ 


catccacacc 


1920 


cagaccgcaa 


qctQqatqtt 


cggcgtcccc 


ccqqaaqccq 


tqqaccccct 


OatQCQCCQQ 

ZJ Zi Zi ZiZJ 


1980 


gcggccaaga 


cggtgaactt 


cggcgtcctc 


tacggcatgt 


ccgcccatag 


gctctcccag 


2040 


gagcttgcca 


tcccctacga 


aaaaacaatcr 


gcctttatag 


^gcgctactt 


ccaaagcttc 


2100 


cccaaggtgc 


qqqcctqqat 


agaaaagacc 


ctqqaqqaqq 


qqaqqaaqcq 


qqqctacqtq 

Zi^Zj^^^^Zi ZD 


2160 


gaaaccctct 


tcqqaaqaaq 


qcqctacqtq 


cccgacctca 


acqcccqqqt 


gaaqagcqtc 


2220 


aqqqaqqccg 


cqqaqcqcat 


ggccttcaac 


atgcccgtcc 


agggcaccgc 

^ Zl Z} J 


cgccgacctc 


2280 


atgaagctcg 


ccatggtgaa 


gctcttcccc 


cgcctccggg 


agatgggggc 


ccgcatgctc 


2340 


ctccaggtcg 


ccaacgagct 


cctcctggag 


gccccccaag 


cgcgggccga 


ggaggtggcg 


2400 


gctttggcca 


aggaggccat 


ggagaaggcc 


tatcccctcg 


ccgtgcccct 


ggaggtggag 


2460 


gtggggatgg 


gggagg^-ctg 


gctttccgcc 


aagggtcacc 


accaccacca 


ccac 


2514 



<210> 500 

<211> 838 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 500 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Glu Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 
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Leu lie Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 



Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg lie Ser lie Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn lie 
180 185 190 

Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Ala Lys Leu lie Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 215 220 

Pro Ala Ser Val Arg Glu Lys lie Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val His Arg Ala Ala 
325 330 335 

Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu Val 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 
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Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn Leu 
405 410 415 



Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu Ala 
450 455 460 

Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 495 

Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro lie 
515 520 525 

Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn Thr 
530 535 540 

Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val Ala 
595 600 605 

Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie His Thr 
625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Pro 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu Glu 
675 680 685 

Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 
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Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg 
725 730 735 

Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val Ala 
770 775 780 

Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val Ala 
785 790 795 800 

Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys Gly 
820 825 830 

His His His His His His 
835 

<210> 501 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 501 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccgag gccccctcct tccgccacga ggcctacggg 24 0 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 54 0 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 
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aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 7 80 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 84 0 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 102 0 

gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 1080 

gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 12 00 

gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 1260 

ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 13 2 0 

gtcctggccc atatggaggc caccggggta cggcgggacg tggcctacct tcaggccctt 13 8 0 

tccctggagc ttgcggagga gat ccgccgc ctcgaggagg aggtcttccg cttggcgggc 144 0 

caccccttca acctcaactc ccgggaccag ctggaaaggg tgctctttga cgagcttagg 1500 

cttcccgcct tgaagaagac gaagaagaca ggcaagcgct ccaccagcgc cgcggtgctg 156 0 

gaggccctac gggaggccca ccccatcgtg gagaagatcc tccagcaccg ggagctcacc 162 0 

aagctcaaga acacctacgt ggaccccctc ccaagcctcg tccacccgag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggga ggcttagtag ctccgacccc 1740 

aacctgcaga acatccccgt ccgcaccccc ttgggccaga ggatccgccg ggccttcgtg 1800 

gccgaggcgg gttgggcgtt ggtggccctg gactatagcc agatagagct ccgcgtcctc 1860 

gcccacctct ccggggacga aaacctgatc agggtcttcc aggaggggaa ggacatccac 192 0 

acccagaccg caagctggat gttcggcgtc cccccggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agacggtgaa cttcggcgtc ctctacggca tgtccgccca taggctct cc 2 040 

caggagcttg ccatccccta cgaggaggcg gtggccttta tagagcgcta cttccaaagc 2100 

ttccccaagg tgcgggcctg gatagaaaag accctggagg aggggaggaa gcggggctac 2160 

gtggaaaccc tcttcggaag aaggcgctac gtgcccgacc tcaacgcccg ggtgaagagc 222 0 

gtcagggagg ccgcggagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tcgccatggt gaagctcttc ccccgcctcc gggagatggg ggcccgcatg 234 0 
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2517 



ctcctccagg tcgccaacga gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 2400 
gcggctttgg ccaaggaggc catggagaag gcctatcccc tcgccgtgcc cctggaggtg 2460 
gaggtgggga tgggggagga ctggctttcc gccaagggtc accaccacca ccaccac 



<210> 502 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 502 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
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210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
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515 



520 



525 



He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 

770 775 780 

Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
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820 825 830 

Gly His His His His His His 
835 

<210> 503 

<211> 34 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 503 

atcgtggtct ttgacgccga ggccccctcc ttcc 



<210> 504 

<211> 34 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 504 

ggaaggaggg ggcctcggcg tcaaagacca cgat 



<210> 505 

<211> 2520 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 505 

atgaattcgg aggcgatgct gcccctcttt gagcccaagg gccgggtcct cctggtggac 



34 



34 



60 
120 
180 



ggccaccacc tggcctaccg caccttccac gccctgaagg gcctcaccac cagccggggg 
gagccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct caaggaggac 

ggggacgcgg tgatcgtggt ctttgacgcc gaggccccct ccttccgcca cgaggcctac 24 0 

ggggggtaca aggcgggccg ggcccccacg ccggaggact ttccccggca actcgccctc 300 

atcaaggagc tggtggacct cctggggttc acgcgcctcg aggtcccggg ctacgaggcg 3 60 

gacgacgtcc tggccagcct ggccaagaag gcggaaaagg agggctacga ggtccgcatc 420 

ctcaccgccg acaaagacct ttaccagctc ctttccgacc gcatccacgt cctccacccc 480 

gaggggtacc tcatcacccc ggcctggctt tgggaaaagt acggcctgag gcccgaccag 540 

tgggccgact accgggccct gaccggggac gagtccgaca accttcccgg ggtcaagggc 600 
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atcggggaga agacggcgag gaagcttctg gaggagtggg ggagcctgga agccctcctc 660 

aagaacctgg accggctgaa gcccgccatc cgggagaaga tcctggccca catggacgat 72 0 

ctgaagctct cctgggacct ggccaaggtg cgcaccgacc tgcccctgga ggtggacttc 780 

gccaaaaggc gggagcccga ccgggagagg cttagggcct ttctggagag gcttgagttt 84 0 

ggcagcctcc tccacgagtt cggccttctg gaaagcccca aggccctgga ggaggccccc 900 

tggcccccgc cggaaggggc cttcgtgggc tttgtgcttt cccgcaagga gcccatgtgg 960 

gccgatcttc tggccctggc cgccgccagg ggcggccgcg tgcaccgggc agcagacccc 102 0 

ttggcggggc taaaggacct caaggaggtc cggggcctcc tcgccaagga cctcgccgtc 1080 

ttggcctcga gggaggggct agacctcgtg cccggggacg accccatgct cctcgcctac 1140 

ctcctggacc cttcgaacac cacccccgag ggggtggcgc ggcgctacgg gggggagtgg 12 00 

acggaggacg ccgcccaccg ggccctcctc tcggagaggc tccatcggaa cctccttaag 1260 

cgcctcgagg gggaggagaa gctcctttgg ctctaccacg aggtggaaaa gcccctctcc 132 0 

cgggtcctgg cccatatgga ggccaccggg gtacggcggg acgtggccta ccttcaggcc 13 8 0 

ctttccctgg agcttgcgga ggagatccgc cgcctcgagg aggaggtctt ccgcttggcg 144 0 

ggccacccct tcaacctcaa ctcccgggac cagctggaaa gggtgctctt tgacgagctt 1500 

aggcttcccg ccttgaagaa gacgaagaag acaggcaagc gctccaccag cgccgcggtg 1560 

ctggaggccc tacgggaggc ccaccccatc gtggagaaga tcctccagca ccgggagctc 1620 

accaagctca agaacaccta cgtggacccc ctcccaagcc tcgtccaccc gaggacgggc 168 0 

cgcctccaca cccgcttcaa ccagacggcc acggccacgg ggaggcttag tagctccgac 1740 

cccaacctgc agaacatccc cgtccgcacc cccttgggcc agaggatccg ccgggccttc 1800 

gtggccgagg cgggttgggc gttggtggcc ctggactata gccagataga gctccgcgtc 1860 

ctcgcccacc tctccgggga cgaaaacctg atcagggtct tccaggaggg gaaggacatc 192 0 

cacacccaga ccgcaagctg gatgttcggc gtccccccgg aggccgtgga ccccctgatg 1980 

cgccgggcgg ccaagacggt gaacttcggc gtcctctacg gcatgtccgc ccataggctc 2 04 0 

tcccaggagc ttgccatccc ctacgaggag gcggtggcct ttatagagcg ctacttccaa 2100 

agcttcccca aggtgcgggc ctggatagaa aagaccctgg aggaggggag gaagcggggc 2160 

tacgtggaaa ccctcttcgg aagaaggcgc tacgtgcccg acctcaacgc ccgggtgaag 2220 

agcgtcaggg aggccgcgga gcgcatggcc ttcaacatgc ccgtccaggg caccgccgcc 2280 
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gacctcatga agctcgccat ggtgaagctc ttcccccgcc tccgggagat gggggcccgc 2340 

atgctcctcc aggtcgccaa cgagctcctc ctggaggccc cccaagcgcg ggccgaggag 2400 

gtggcggctt tggccaagga ggccatggag aaggcctatc ccctcgccgt gcccctggag 2460 

gtggaggtgg ggatggggga ggactggctt tccgccaagg gtcaccacca ccaccaccac 2 520 



<210> 506 

<211> 840 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 506 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val 
50 55 60 

lie Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr 
65 70 75 80 

Gly Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg 
85 90 95 

Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg 
100 105 110 

Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala 
115 120 125 

Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp 
130 135 140 

Lys Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro 
145 150 155 160 

Glu Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu 
165 170 175 

Arg Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser 
180 185 190 

Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys 
195 200 205 
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Leu Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp 
210 215 220 

Arg Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp 
225 230 235 240 

Leu Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg 
260 265 270 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro 
290 295 300 

Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp 
305 310 315 320 

Ala Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg 
325 330 335 

Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 
340 345 350 

Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 
355 360 365 

Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 
370 375 380 

Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp 
385 390 395 400 

Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg 
405 410 415 

Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 
450 455 460 

Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly 
500 505 510 
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hys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 
515 520 525 

Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys 
530 535 540 

Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 
595 600 605 

Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He 
625 630 635 640 

His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val 
645 650 655 

Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr 
675 680 685 

Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 
725 730 735 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 
770 775 780 

val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
785 790 795 800 

Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 
805 810 815 
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Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 
820 825 830 

Lys Gly His His His His His His 
835 840 





<210> 


507 




<211> 


36 




<212> 


DNA 




<213> 


Artificial 




<220> 






<223> 


Synthetic 




<400> 


507 




cagaccatga attcggaggc 




<210> 


508 




<211> 


36 




<212> 


DNA 




<213> 


Artificial 




<220> 






<223> 


Synthetic 










<400> 


508 




aaagaggggc agcatcgcct 




<210> 


509 




<211> 


2517 




<212> 


DNA 




<213> 


Artificial 



<220> 

<223> Synthetic 
<400> 509 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 



36 



36 



60 
120 



caccacctgg cctaccgcac cttctttgcc ctgaagggcc tcaccaccag ccggggggag 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
gacgcggtga tcgtggtctt tgacgccgag gccccctcct tccgccacga ggcctacggg 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 
aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 360 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 



240 
300 



480 



gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 
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gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 84 0 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 90 0 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 102 0 

gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 1080 

gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 1140 

m ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 12 00 

^ gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 12 60 

2: ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 13 2 0 

- gtcctggccc atatggaggc caccggggta cggcgggacg tggcctacct tcaggccctt 13 8 0 

|f1 tccctggagc ttgcggagga gatccgccgc ctcgaggagg aggtcttccg cttggcgggc 144 0 

p caccccttca acctcaactc ccgggaccag ctggaaaggg tgctctttga cgagcttagg 150 0 

cttcccgcct tgaagaagac gaagaagaca ggcaagcgct ccaccagcgc cgcggtgctg 1560 

gaggccctac gggaggccca ccccatcgtg gagaagatcc tccagcaccg ggagctcacc 162 0 

aagctcaaga acacctacgt ggaccccctc ccaagcctcg tccacccgag gacgggccgc 168 0 

ctccacaccc gcttcaacca gacggccacg gccacgggga ggcttagtag ctccgacccc 174 0 

aacctgcaga acatccccgt ccgcaccccc ttgggccaga ggatccgccg ggccttcgtg 1800 

gccgaggcgg gttgggcgtt ggtggccctg gactatagcc agatagagct ccgcgtcctc 1860 

gcccacctct ccggggacga aaacctgatc agggtcttcc aggaggggaa ggacatccac 1920 

acccagaccg caagctggat gttcggcgtc cccccggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agacggtgaa cttcggcgtc ctctacggca tgtccgccca taggctctcc 2040 

caggagcttg ccatccccta cgaggaggcg gtggccttta tagagcgcta cttccaaagc 2100 

ttccccaagg tgcgggcctg gatagaaaag accctggagg aggggaggaa gcggggctaa 2160 

gtggaaaccc tcttcggaag aaggcgctac gtgcccgacc tcaacgcccg ggtgaagagc 222 0 
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2517 



gtcagggagg ccgcggagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 80 

ctcatgaagc tcgccatggt gaagctcttc ccccgcctcc gggagatggg ggcccgcatg 2340 

ctcctccagg tcgccaacga gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 2400 

gcggctttgg ccaaggaggc catggagaag gcctatcccc tcgccgtgcc cctggaggtg 2460 
gaggtgggga tgggggagga ctggctttcc gccaagggtc accaccacca ccaccac 



<210> 510 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 510 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 
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Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 

485 490 495 
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Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 
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Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Gly His His His His His His 
835 

<210> 511 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 511 

gcctaccgca ccttctttgc cctgaagggc etc 3 3 



<210> 512 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 512 

gaggcccttc agggcaaaga aggtgcggta ggc 33 



<210> 513 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 513 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcag agaggacggg 180 

gacgcggtga tcgtggtctt tgacgccgag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 3 60 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 
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gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 
gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 
gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 
ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 
gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 
ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 
gtcctggccc atatggaggc caccggggta cggcgggacg tggcctacct tcaggccctt 
tccctggagc ttgcggagga gatccgccgc ctcgaggagg aggtcttccg cttggcgggc 
caccccttca acctcaactc ccgggaccag ctggaaaggg tgctctttga cgagcttagg 
cttcccgcct tgaagaagac gaagaagaca ggcaagcgct ccaccagcgc cgcggtgctg 1560 
gaggccctac gggaggccca ccccatcgtg gagaagatcc tccagcaccg ggagctcacc 1620 
aagctcaaga acacctacgt ggaccccctc ccaagcctcg tccacccgag gacgggccgc 
ctccacaccc gcttcaacca gacggccacg gccacgggga ggcttagtag ctccgacccc 
aacctgcaga acatccccgt ccgcaccccc ttgggccaga ggatccgccg ggccttcgtg 
gccgaggcgg gttgggcgtt ggtggccctg gactatagcc agatagagct ccgcgtcctc 
gcccacctct ccggggacga aaacctgatc agggtcttcc aggaggggaa ggacatccac 
acccagaccg caagctggat gttcggcgtc cccccggagg ccgtggaccc cctgatgcgc 
cgggcggcca agacggtgaa cttcggcgtc ctctacggca tgtccgccca taggctctcc 
caggagcttg ccatccccta cgaggaggcg gtggccttta tagagcgcta cttccaaagc 2100 
ttccccaagg tgcgggcctg gatagaaaag accctggagg aggggaggaa gcggggctac 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



1680 
1740 
1800 
1860 
1920 
1980 
2040 



2160 
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gtggaaaccc tcttcggaag aaggcgctac gtgcccgacc tcaacgcccg ggtgaagagc 2220 

gtcagggagg ccgcggagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tcgccatggt gaagctcttc ccccgcctcc gggagatggg ggcccgcatg 2340 

ctcctccagg tcgccaacga gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 24 00 

gcggctttgg ccaaggaggc catggagaag gcctatcccc tcgccgtgcc cctggaggtg 2460 

gaggtgggga tgggggagga ctggctttcc gccaagggtc accaccacca ccaccac 2517 



<210> 514 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 514 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu- Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
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180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
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485 



490 



495 



Asp Glu Leu Arg Leu Pro Ala Leu 
500 

Arg Ser Thr Ser Ala Ala Val Leu 
515 520 



Lys Lys Thr Lys Lys Thr Gly Lys 
505 510 

Glu Ala Leu Arg Glu Ala His Pro 
525 



lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 



Leu His Thr Arg 



Ser Ser Asp Pro 
580 

Gin Arg lie Arg 
595 

Ala Leu Asp Tyr 
610 

Gly Asp Glu Asn 
625 

Thr Gin Thr Ala 



Phe Asn Gin Thr 
565 

Asn Leu Gin Asn 



Arg Ala Phe Val 
600 

Ser Gin lie Glu 
615 

Leu lie Arg Val 
630 

Ser Trp Met Phe 
645 



Ala Thr Ala Thr 
570 

lie Pro Val Arg 
585 

Ala Glu Ala Gly 



Leu Arg Val Leu 
620 

Phe Gin Glu Gly 
635 

Gly Val Pro Pro 
650 



Gly Arg Leu Ser 
575 

Thr Pro Leu Gly 
590 

Trp Ala Leu Val 
605 

Ala His Leu Ser 



Lys Asp lie His 
640 

Glu Ala Val Asp 
655 



Pro Leu Met Arg 
660 

Gly Met Ser Ala 
675 

Glu Ala Val Ala 
690 

Arg Ala Trp lie 
705 

Val Glu Thr Leu 



Arg Val Lys Ser 
740 

Pro Val Gin Gly 
755 



Arg Ala Ala Lys 



His Arg Leu Ser 
680 

Phe lie Glu Arg 
695 

Glu Lys Thr Leu 
710 

Phe Gly Arg Arg 
725 

Val Arg Glu Ala 



Thr Ala Ala Asp 
760 



Thr Val Asn Phe 
665 

Gin Glu Leu Ala 



Tyr Phe Gin Ser 
700 

Glu Glu Gly Arg 
715 

Arg Tyr Val Pro 
730 

Ala Glu Arg Met 
745 

Leu Met Lys Leu 



Gly Val Leu Tyr 
670 

lie Pro Tyr Glu 
685 

Phe Pro Lys Val 



Lys Arg Gly Tyr 
720 

Asp Leu Asn Ala 
735 

Ala Phe Asn Met 
750 

Ala Met Val Lys 
765 



Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 



Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
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785 



790 795 800 



Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Gly His His His His His His 
835 



<210> 


515 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


515 


ctcctcaagg ccctcagaga 


<210> 


516 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


516 


cgcgtccccg tcctctctga 


<210> 


517 


<211> 


2517 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


517 



33 



33 



60 



atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccgag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 

aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 3 60 

gacgtcctgg ccaccctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 
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accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 
gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 
gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 
ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 
gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 
ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 
gtcctggccc atatggaggc caccggggta cggcgggacg tggcctacct tcaggccctt 
tccctggagc ttgcggagga gatccgccgc ctcgaggagg aggtcttccg cttggcgggc 
caccccttca acctcaactc ccgggaccag ctggaaaggg tgctctttga cgagcttagg 
cttcccgcct tgaagaagac gaagaagaca ggcaagcgct ccaccagcgc cgcggtgctg 
gaggccctac gggaggccca ccccatcgtg gagaagatcc tccagcaccg ggagctcacc 
aagctcaaga acacctacgt ggaccccctc ccaagcctcg tccacccgag gacgggccgc 
ctccacaccc gcttcaacca gacggccacg gccacgggga ggcttagtag ctccgacccc 
aacctgcaga acatccccgt ccgcaccccc ttgggccaga ggatccgccg ggccttcgtg 
gccgaggcgg gttgggcgtt ggtggccctg gactatagcc agatagagct ccgcgtcctc 
gcccacctct ccggggacga aaacctgatc agggtcttcc aggaggggaa ggacatccac 
acccagaccg caagctggat gttcggcgtc cccccggagg ccgtggaccc cctgatgcgc 
cgggcggcca agacggtgaa cttcggcgtc ctctacggca tgtccgccca taggctctcc 
caggagcttg ccatccccta cgaggaggcg gtggccttta tagagcgcta cttccaaagc 2100 



480 
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600 
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720 
780 
840 
900 
960 
1020 
1080 
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1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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ttccccaagg tgcgggcctg gatagaaaag accctggagg aggggaggaa gcggggctac 2160 

gtggaaaccc tcttcggaag aaggcgctac gtgcccgacc tcaacgcccg ggtgaagagc 222 0 

gtcagggagg ccgcggagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 80 

ctcatgaagc tcgccatggt gaagctcttc ccccgcctcc gggagatggg ggcccgcatg 2340 

ctcctccagg tcgccaacga gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 2400 

gcggctttgg ccaaggaggc catggagaag gcctatcccc tcgccgtgcc cctggaggtg 24 60 
gaggtgggga tgggggagga ctggctttcc gccaagggtc accaccacca ccaccac 



<210> 518 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 518 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 
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Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 
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His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 
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Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Gly His His His His His His 
835 

<210> 519 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 519 

gacgacgtcc tggccaccct ggccaagaag gcg 





<210> 


520 




<211> 


33 




<212> 


DNA 




<213> 


Artificial 


^iT ;; 


<220> 






<223> 


Synthetic 




<400> 


520 




cgccttcttg gccagggtgg 




<210> 


521 




<211> 


2517 




<212> 


DNA 




<213> 


Artificial 




<220> 






<223> 


Synthetic 




<400> 


521 



33 



33 



atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccgag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggttcacg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 
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gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 




V >— • y V C4. X— V- ^ 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


cccacgt. ed- 


\^ d. O ^ y cLy 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


ge ctgaggcc 


cy dcody f-yy 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


tiucccggggc 


cddy y y ed 


600 


ggggagaaga 


cggcgctcaa 


gcttctggag 


gagtggggga 


gcc L.y y ddy u 


l^V^V^ L-»^Cl c-y 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


egg ccc-d-Cci l 


yydoyciu.^-^ i-y 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgaccugc 


/-» r-i /-1 1— /^r~f a (TCtI" 

ccccyyctyy l. 


yycio I— ^y^-v-* 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


T^ggs^g^ggc*- 


L-ydy L. yy 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


yy cet-ce uyy 


90 0 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


cax,g Lyyy ee 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agacccc ttg 


X u ^ u 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccguc cuy 


J_ V O V 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcc uacc L-u 


114 0 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


19 0 0 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1 9 ^ O 
± Z D U 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctct cccgg 


1 "5 9 0 
X -5 Z U 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacc t 


L (_ dy y e *— L- L- 


13 80 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cccggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcu cagg 


1 RO n 


cttcccgcct 


tgaagaagac 


gaagaagaca ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


T IT r\ 

lb o \J 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


162 0 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


T Q O 

lb o U 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


174 0 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcgtg 


180 0 


gccgaggcgg 


gttgggcgtt 


ggtggccctg 


gactatagcc 


agatagagct 


ccgcgtcctc 


1860 


gcccacctct 


ccggggacga 


aaacctgatc 


agggtcttcc 


agg^gggg^^ 


ggacatccac 


1920 


acccagaccg 


caagctggat 


gttcggcgtc 


cccccggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agacggtgaa 


cttcggcgtc 


ctctacggca 


tgtccgccca 


taggctctcc 


2040 
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caggagcttg ccatccccta cgaggaggcg gtggccttta tagagcgcta cttccaaagc 2100 

ttccccaagg tgcgggcctg gatagaaaag accctggagg aggggaggaa gcggggctac 2160 

gtggaaaccc tcttcggaag aaggcgctac gtgcccgacc tcaacgcccg ggtgaagagc 2220 

gtcagggagg ccgcggagcg catggccttc aacatgcccg tccagggcac cgccgccgac 228 0 

ctcatgaagc tcgccatggt gaagctcttc ccccgcctcc gggagatggg ggcccgcatg 2340 

ctcctccagg tcgccaacga gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 2400 

gcggctttgg ccaaggaggc catggagaag gcctatcccc tcgccgtgcc cctggaggtg 24 60 
gaggtgggga tgggggagga ctggctttcc gccaagggtc accaccacca ccaccac 



<210> 522 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 522 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 
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Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 IVO 1V5 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 
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Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 



His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 



Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 
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Leu Phe Pro Arg Leu Arg Glu Net Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Gly His His His His His His 
835 

<210> 523 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 523 

ggggagaaga cggcgctcaa gcttctggag gag 



33 



<210> 524 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 524 

ctcctccaga agcttgagcg ccgtcttctc ccc 33 



<210> 525 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 525 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
gacgcggtga tcgtggtctt tgacgccgag gccccctcct tccgccacga ggcctacggg 240 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 



977 



aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 3 60 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 42 0 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 66 0 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 72 0 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggaggggctt aaggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 102 0 

gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 1080 

gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 114 0 

ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 12 0 0 

g^ggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 12 60 

ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 1320 

gtcctggccc atatggaggc caccggggta cggcgggacg tggcctacct tcaggccctt 1380 

tccctggagc ttgcggagga gatccgccgc ctcgaggagg aggtcttccg cttggcgggc 144 0 

caccccttca acctcaactc ccgggaccag ctggaaaggg tgctctttga cgagcttagg 1500 

cttcccgcct tgaagaagac gaagaagaca ggcaagcgct ccaccagcgc cgcggtgctg 1560 

gaggccctac gggaggccca ccccatcgtg gagaagatcc tccagcaccg ggagctcacc 162 0 

aagctcaaga acacctacgt ggaccccctc ccaagcctcg tccacccgag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggga ggcttagtag ctccgacccc 1740 

aacctgcaga acatccccgt ccgcaccccc ttgggccaga ggatccgccg ggccttcgtg 1800 

gccgaggcgg gttgggcgtt ggtggccctg gactatagcc agatagagct ccgcgtcctc 1860 

gcccacctct ccggggacga aaacctgatc agggtcttcc aggaggggaa ggacatccac 1920 

acccagaccg caagctggat gttcggcgtc cccccggagg ccgtggaccc cctgatgcgc 1980 
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cgggcggcca 


agacggtgaa 


cttcggcgtc 


ctctacggca 


tgtccgccca 


taggctctcc 


2040 


caggagcttg 


ccatccccta 


cgaggaggcg 


gtggccttta tagagcgcta 


cttccaaagc 


2100 


ttccccaagg 


tgcgggcctg 


gatagaaaag 


accctggagg 


aggggaggaa 


gcggggctac 


2160 


gtggaaaccc 


tcttcggaag 


aaggcgctac 


gtgcccgacc 


tcaacgcccg 


ggtgaagagc 


2220 


gtcagggagg 


ccgcggagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tcgccatggt 


gaagctcttc 


ccccgcctcc 


gggagatggg 


ggcccgcatg 


2340 


ctcctccagg 


tcgccaacga gctcctcctg gaggcccccc 


aagcgcgggc 


cgaggaggtg 


2400 


gcggctttgg 


ccaaggaggc 


catggagaag 


gcctatcccc 


tcgccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


tgggggagga 


ctggctttcc 


gccaagggtc 


accaccacca 


ccaccac 


2517 



^ <210> 526 

<211> 839 

W <212> PRT 

ffl <213> Artificial 

m <220> 

<223> Synthetic 

<400> 526 

p Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 



£ Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 

A 20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 VO V5 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
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145 



150 155 160 



Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
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450 



455 460 



Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
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755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Gly His His His His His His 
835 

<210> 527 

<211> 39 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 527 

gagcccgacc gggaggggct taaggccttt ctggagagg 3 9 



<210> 528 

<211> 39 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 528 

cctctccaga aaggccttaa gcccctcccg gtcgggctc 3 9 



<210> 529 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 529 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccgag gccccctcct tccgccacga ggcctacggg 24 0 
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gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 
aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 360 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 
agcctcctcc acgagttcgg ccttctggga ggggagaagc cccgggagga ggccccctgg 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 
gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 
gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 
ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 
gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 
ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 
gtcctggccc atatggaggc caccggggta cggcgggacg tggcctacct tcaggccctt 13 8 0 
tccctggagc ttgcggagga gatccgccgc ctcgaggagg aggtcttccg cttggcgggc 
caccccttca acctcaactc ccgggaccag ctggaaaggg tgctctttga cgagcttagg 
cttcccgcct tgaagaagac gaagaagaca ggcaagcgct ccaccagcgc cgcggtgctg 
gaggccctac gggaggccca ccccatcgtg gagaagatcc tccagcaccg ggagctcacc 
aagctcaaga acacctacgt ggaccccctc ccaagcctcg tccacccgag gacgggccgc 
ctccacaccc gcttcaacca gacggccacg gccacgggga ggcttagtag ctccgacccc 1740 
aacctgcaga acatccccgt ccgcaccccc ttgggccaga ggatccgccg ggccttcgtg 
gccgaggcgg gttgggcgtt ggtggccctg gactatagcc agatagagct ccgcgtcctc 
gcccacctct ccggggacga aaacctgatc agggtcttcc aggaggggaa ggacatccac 



420 
480 
540 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



1440 
1500 
1560 
1620 
1680 



1800 
1860 
1920 
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acccagaccg caagctggat gttcggcgtc cccccggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agacggtgaa cttcggcgtc ctctacggca tgtccgccca taggctctcc 2040 

caggagcttg ccatccccta cgaggaggcg gtggccttta tagagcgcta cttccaaagc 2100 

ttccccaagg tgcgggcctg gatagaaaag accctggagg aggggaggaa gcggggctac 2160 

gtggaaaccc tcttcggaag aaggcgctac gtgcccgacc tcaacgcccg ggtgaagagc 2220 

gtcagggagg ccgcggagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 80 

ctcatgaagc tcgccatggt gaagctcttc ccccgcctcc gggagatggg ggcccgcatg 2340 

ctcctccagg tcgccaacga gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 240 0 

gcggctttgg ccaaggaggc catggagaag gcctatcccc tcgccgtgcc cctggaggtg 24 60 

gaggtgggga tgggggagga ctggctttcc gccaagggtc accaccacca ccaccac 2517 



<210> 530 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 530 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 
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Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro Pro Pro Glu 



Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 

420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 
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Gly Val Arg Arg Asp 
450 

Ala Glu Glu lie Arg 
465 

His Pro Phe Asn Leu 
485 



Val Ala Tyr Leu Gin Ala 
455 

Arg Leu Glu Glu Glu Val 
470 475 

Asn Ser Arg Asp Gin Leu 
490 



Leu Ser Leu Glu Leu 
460 

Phe Arg Leu Ala Gly 
480 

Glu Arg Val Leu Pile 
495 



Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 



lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 

530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 

545 550 555 560 



Leu His Thr Arg 

Ser Ser Asp Pro 
580 



Phe Asn Gin Thr 
565 

Asn Leu Gin Asn 



Ala Thr Ala Thr 
570 

lie Pro Val Arg 
585 



Gly Arg Leu Ser 
575 

Thr Pro Leu Gly 
590 



Gin Arg lie Arg Arg 
595 

Ala Leu Asp Tyr Ser 
610 

Gly Asp Glu Asn Leu 
625 



Ala Phe Val Ala Glu Ala 
600 

Gin lie Glu Leu Arg Val 
615 

lie Arg Val Phe Gin Glu 
630 635 



Gly Trp Ala Leu Val 
605 

Leu Ala His Leu Ser 
620 

Gly Lys Asp lie His 
640 



Thr Gin Thr Ala 



Pro Leu Met Arg 
660 

Gly Met Ser Ala 
675 

Glu Ala Val Ala 
690 

Arg Ala Trp lie 
705 

Val Glu Thr Leu 



Arg Val Lys Ser 
740 



Ser Trp Met Phe 
645 

Arg Ala Ala Lys 



His Arg Leu Ser 
680 

Phe lie Glu Arg 
695 

Glu Lys Thr Leu 
710 

Phe Gly Arg Arg 
725 

Val Arg Glu Ala 



Gly Val Pro Pro 
650 

Thr Val Asn Phe 
665 

Gin Glu Leu Ala 



Tyr Phe Gin Ser 
700 

Glu Glu Gly Arg 
715 

Arg Tyr Val Pro 
730 

Ala Glu Arg Met 
745 



Glu Ala Val Asp 
655 

Gly Val Leu Tyr 
670 

lie Pro Tyr Glu 
685 

Phe Pro Lys Val 



Lys Arg Gly Tyr 
720 

Asp Leu Asn Ala 
735 

Ala Phe Asn Met 
750 
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Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Gly His His His His His His 
835 

<210> 531 

<211> 54 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 531 

cacgagttcg gccttctggg aggggagaag ccccgggagg aggccccctg gccc 54 



<210> 532 

<211> 54 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 532 

gggccagggg gcctcctccc ggggcttctc ccctcccaga aggccgaact cgtg 54 



<210> 533 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 533 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 18 0 
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gacgcggtga tcgtggtctt tgacgccgag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 3 00 

aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 3 60 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 4 80 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

gggg^9^^9^ cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc ctgcaggggc ggccgcgtgc accgggcagc agaccccttg 102 0 

gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 10 8 0 

gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 114 0 

ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 1200 

gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 1260 

ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 1320 

gtcctggccc atatggaggc caccggggta cggcgggacg tggcctacct tcaggccctt 13 80 

tccctggagc ttgcggagga gatccgccgc ctcgaggagg aggtcttccg cttggcgggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tgctctttga cgagcttagg 1500 

cttcccgcct tgaagaagac gaagaagaca ggcaagcgct ccaccagcgc cgcggtgctg 1560 

gaggccctac gggaggccca ccccatcgtg gagaagatcc tccagcaccg ggagctcacc 1620 

aagctcaaga acacctacgt ggaccccctc ccaagcctcg tccacccgag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggga ggcttagtag ctccgacccc 174 0 

aacctgcaga acatccccgt ccgcaccccc ttgggccaga ggatccgccg ggccttcgtg 1800 

gccgaggcgg gttgggcgtt ggtggccctg gactatagcc agatagagct ccgcgtcctc 1860 
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gcccacctct ccggggacga aaacctgatc agggtcttcc aggaggggaa ggacatccac 192 0 

acccagaccg caagctggat gttcggcgtc cccccggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agacggtgaa cttcggcgtc ctctacggca tgtccgccca taggctctcc 2040 

caggagcttg ccatccccta cgaggaggcg gtggccttta tagagcgcta cttccaaagc 2100 

ttccccaagg tgcgggcctg gatagaaaag accctggagg aggggaggaa gcggggctac 2160 

gtggaaaccc tcttcggaag aaggcgctac gtgcccgacc tcaacgcccg ggtgaagagc 2220 

gtcagggagg ccgcggagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 8 0 

ctcatgaagc tcgccatggt gaagctcttc ccccgcctcc gggagatggg ggcccgcatg 2 34 0 

ctcctccagg tcgccaacga gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 2400 

gcggctttgg ccaaggaggc catggagaag gcctatcccc tcgccgtgcc cctggaggtg 24 60 

gaggtgggga tgggggagga ctggctttcc gccaagggtc accaccacca ccaccac 2 517 



<210> 534 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 534 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 
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Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 



Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Cys Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 
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Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 
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Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 



Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Gly His His His His His His 
835 

<210> 535 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 535 

ctggccctgg ccgcctgcag gggcggccgc gtg 33 



<210> 536 

<211> 33 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 536 

cacgcggccg cccctgcagg cggccagggc cag 3 3 



<210> 537 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 537 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 6 0 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 12 0 
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ccggtgcagg 


cggtctacgg 


cttcgccaag agcctcctca aggccctcaa 


ggaggacggg 


1 fiO 

J. O U 


gacgcggtga 


tcgtggtctt 


tgacgccgag gccccctcct tccgccacga ggcctacggg 


9 A n 

Z VJ 


gggtacaagg 


cgggccgggc 


ccccacgccg gaggactttc cccggcaact 


cgccctcatc 


ion 

J u u 


aaggagctgg 


tggacctcct 


ggggttcacg cgcctcgagg tcccgggcta 


cgaggcggac 


O D U 


gacgtcctgg 


ccagcctggc 


caagaaggcg gaaaaggagg gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt tccgaccgca tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 


J 1 w 


gccgactacc 


gggccctgac 


/^fTrrrrrra r<np rr f" non^C^PiC^C 1" tCCCCfOQCft 

^yyyy^ y y y ^ ^^^^ 


caaqqacatc 


n 0 

u w w 


ggggagaaga 


cggcgaggaa 


rTr»i- -1- r't-rra ii^rt n^nt OCSOCSCSR^ QCCtCTQaaCfC 


cctcctcaag 


o o u 


aacctggacc 


ggctgaagcc 


Cy ooct L.O\_-yy yciyciciyciu^v- »^y^»w.v_-v-.oit-.*^»- 


QQacQatctq 


72 0 


aagctctcct 


gggacctggc 


caaggcgcgc accya-ct^uyu ^-^o^^uyycxyy u 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggccc agggccuuuc Lyyciyo-yyuu 


tgs-gtttggc 


O rt U 


agcctcctcc 


acgagttcgg 


ccttctggaa agccccaagg ccctggagga 


yyi_i>^*^k_-v_. yy 


Q o n 

J U U 


cccccgccgg 


aaggggcctt 


cgtgggcttt gtgcctuccc gcaaggagcc 


r' a i" rr t~ cf era P 
v_.ctL-y yyy^^ 


Q D 
you 


gatcttctgg 


ccctggccgc 


cgccaggggc ggccgcgtgc accgggcagc 


d.^^ d \^ \^ <^ ^ ^ ^ ^ 


1 n o n 
X U u 


gcggggctaa 


aggacctcaa 


gg^ggt<^^gg ggcctcctcg ccaaggacct 


t^yi,^\^y L-^ L-y 


1 n A fi 

X u o u 


gcctcgaggg 


aggggctaga 


cctcgtgccc ggggacgacc ccatgctcct 


cgcctacctc 




ctggaccctt 


cgaacaccac 


ccccgagggg gtggcgcggc gctacggggg 


ggagtggacg 




gaggacgccg 


cccaccgggc 


cctcctctcg gagaggctcc atcggaacct 


ccttaagcgc 




ctcgaggggg 


aggagaagct 


cctttggctc taccacgagg tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta cggcgggacg tggcctacct 


tcaggccctt 


13 8 0 


tccctggagc 


ttgcggagga 


gatccgccgc ctcgaggagg aggtcttccg 


cttggcgggc 


144 U 


caccccttca 


acctcaactc 


ccgggaccag ctggaaaggg tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tgaagaagac 


gaagaagaca ggcaagcgct ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg gagaagatcc tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc ccaagcctcg tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca gacggccacg gccacgggga ggcttagtag ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc ttgggccaga ggatccgccg 


ggccttcgtg 


1800 
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gccgaggcgg gttgggcgtt ggtggccctg gactatagcc agatagagct ccgcgtcctc 1860 

gcccacctct ccggggacga aaacctgatc agggtcttcc aggaggggaa ggacatccac 192 0 

acccagaccg caagctggat gttcggcgtc cccccggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agacggtgaa cttcggcgtc ctctacggca tgtccgccca taggctctcc 2 04 0 

caggagcttg ccatccccta cgaggaggcg gtggccttta tagagcgcta cttccaaagc 2100 

ttccccaagg tgcgggcctg gatagaaaag accctggagg aggggaggaa gcggggctac 2160 

gtggaaaccc tcttcggaag aaggcgctac gtgcccgacc tcaacgcccg ggtgaagagc 2 220 

gtcagggagg ccgcggagcg catggccttc aacatgcccg tccagggcac cgccgccgac 22 80 

ctcatgaagc tcgccatggt gaagctcttc ccccgcctcc gggagatggg ggcccgcatg 2340 

ctcctccagg tcgccaacga gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 24 00 

gcggctttgg ccaaggaggc catggagaag gcctatcccc tcgccgtgcc cctggaggtg 246 0 

gaggtgggga tgggggagga ctggctttcc gccaagggtc accaccacca ccaccac 2517 



<210> 538 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 538 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
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115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Lys Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
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420 



425 



430 



Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
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725 



730 



735 



Arg Val Lys Ser 
740 

Pro Val Gin Gly 
755 

Leu Phe Pro Arg 
770 

Ala Asn Glu Leu 
785 

Ala Ala Leu Ala 



Pro Leu Glu Val 
820 



Val Arg Glu Ala 



Thr Ala Ala Asp 
760 

Leu Arg Glu Met 
775 

Leu Leu Glu Ala 
790 

Lys Glu Ala Met 
805 

Glu Val Gly Met 



Ala Glu Arg Met 
745 

Leu Met Lys Leu 



Gly Ala Arg Met 
780 

Pro Gin Ala Arg 
795 

Glu Lys Ala Tyr 
810 

Gly Glu Asp Trp 
825 



Ala Phe Asn Met 
750 

Ala Met Val Lys 
765 

Leu Leu Gin Val 



Ala Glu Glu Val 
800 

Pro Leu Ala Val 
815 

Leu Ser Ala Lys 
830 



Gly His His His His His His 
835 



<210> 539 

<211> 42 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 539 

ggggagaaga cggcgaggaa gcttctgaag gagtggggga gc 



<210> 540 

<211> 42 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 540 

gctcccccac tccttcagaa gcttcctcgc cgtcttctcc cc 



<210> 541 

<211> 2520 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 541 

atgaattcgg aggcgatgct gcccctcttt gagcccaagg gccgggtcct cctggtggac 
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ggccaccacc tggcctaccg caccttcttt gccctgaagg gcctcaccac cagccggggg 120 

gagccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct cagagaggac 180 

^gggscgcgg tgatcgtggt ctttgacgcc gaggccccct ccttccgcca cgaggcctac 24 0 

ggggggtaca aggcgggccg ggcccccacg ccggaggact ttccccggca actcgccctc 3 00 

atcaaggagc tggtggacct cctggggttc acgcgcctcg aggtcccggg ctacgaggcg 360 

gacgacgtcc tggccaccct ggccaagaag gcggaaaagg agggctacga ggtccgcatc 420 

ctcaccgccg acaaagacct ttaccagctc ctttccgacc gcatccacgt cctccacccc 480 

9aggggtacc tcatcacccc ggcctggctt tgggaaaagt acggcctgag gcccgaccag 540 

tgggccgact accgggccct gaccggggac gagtccgaca accttcccgg ggtcaagggc 600 

atcggggaga agacggcgct caagcttctg gaggagtggg ggagcctgga agccctcctc 660 

aagaacctgg accggctgaa gcccgccatc cgggagaaga tcctggccca catggacgat 72 0 

ctgaagctct cctgggacct ggccaaggtg cgcaccgacc tgcccctgga ggtggacttc 780 

gccaaaaggc gggagcccga ccgggagggg cttaaggcct ttctggagag gcttgagttt 84 0 

ggcagcctcc tccacgagtt cggccttctg ggaggggaga agccccggga ggaggccccc 90 0 

tggcccccgc cggaaggggc cttcgtgggc tttgtgcttt cccgcaagga gcccatgtgg 960 

gccgatcttc tggccctggc cgcctgcagg ggcggccgcg tgcaccgggc agcagacccc 1020 

ttggcggggc taaaggacct caaggaggtc cggggcctcc tcgccaagga cctcgccgtc 1080 

ttggcctcga gggaggggct agacctcgtg cccggggacg accccatgct cctcgcctac 1140 

ctcctggacc cttcgaacac cacccccgag ggggtggcgc ggcgctacgg gggggagtgg 1200 

acggaggacg ccgcccaccg ggccctcctc tcggagaggc tccatcggaa cctccttaag 12 60 

cgcctcgagg gggaggagaa gctcctttgg ctctaccacg aggtggaaaa gcccctctcc 1320 

cgggtcctgg cccatatgga ggccaccggg gtacggcggg acgtggccta ccttcaggcc 13 80 

ctttccctgg agcttgcgga ggagatccgc cgcctcgagg aggaggtctt ccgcttggcg 144 0 

ggccacccct tcaacctcaa ctcccgggac cagctggaaa gggtgctctt tgacgagctt 1500 

aggcttcccg ccttgaagaa gacgaagaag acaggcaagc gctccaccag cgccgcggtg 1560 

ctggaggccc tacgggaggc ccsLCcccatc gtggagaaga tcctccagca ccgggagctc 162 0 

accaagctca agaacaccta cgtggacccc ctcccaagcc tcgtccaccc gaggacgggc 1680 

cgcctccaca cccgcttcaa ccagacggcc acggccacgg ggaggcttag tagctccgac 1740 
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cccaacctgc agaacatccc cgtccgcacc cccttgggcc agaggatccg ccgggccttc 1800 

gtggccgagg cgggttgggc gttggtggcc ctggactata gccagataga gctccgcgtc 1860 

ctcgcccacc tctccgggga cgaaaacctg atcagggtct tccaggaggg gaaggacatc 192 0 

cacacccaga ccgcaagctg gatgttcggc gtccccccgg aggccgtgga ccccctgatg 1980 

cgccgggcgg ccaagacggt gaacttcggc gtcctctacg gcatgtccgc ccataggctc 2 04 0 

tcccaggagc ttgccatccc ctacgaggag gcggtggcct ttatagagcg ctacttccaa 2100 

agcttcccca aggtgcgggc ctggatagaa aagaccctgg aggaggggag gaagcggggc 2160 

tacgtggaaa ccctcttcgg aagaaggcgc tacgtgcccg acctcaacgc ccgggtgaag 2220 

agcgtcaggg aggccgcgga gcgcatggcc ttcaacatgc ccgtccaggg caccgccgcc 22 80 

gacctcatga agctcgccat ggtgaagctc ttcccccgcc tccgggagat gggggcccgc 2340 

atgctcctcc aggtcgccaa cgagctcctc ctggaggccc cccaagcgcg ggccgaggag 2400 

gtggcggctt tggccaagga ggccatggag aaggcctatc ccctcgccgt gcccctggag 2460 

gtggaggtgg ggatggggga ggactggctt tccgccaagg gtcaccacca ccaccaccac 252 0 



<210> 542 

<211> 840 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 542 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Ala Val 
50 55 60 

He Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr 
65 70 75 80 

Gly Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg 
85 90 95 

Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg 
100 105 110 
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Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala 
115 120 125 

Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp 
130 135 140 

Lys Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro 
145 150 155 160 

Glu Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu 
165 170 175 

Arg Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser 
180 185 190 

Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu Lys 
195 200 205 

Leu Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp 
210 215 220 

Arg Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp 
225 230 235 240 

Leu Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys 
260 265 270 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro Pro Pro 
290 295 300 

Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp 
305 310 315 320 

Ala Asp Leu Leu Ala Leu Ala Ala Cys Arg Gly Gly Arg Val His Arg 
325 330 335 

Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 
340 345 350 

Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 
355 360 365 

Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 
370 375 380 

Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp 
385 390 395 400 

Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg 
405 410 415 
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Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 



His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 
450 455 460 

Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 
515 520 525 

Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys 
530 535 540, 

Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 
595 600 605 

Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie 
625 630 635 640 

His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val 
645 650 655 

Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr 
675 680 685 

Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 



Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly 
705 710 715 720 
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Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 
725 730 735 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 
770 775 780 

Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
785 790 795 800 

Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 
820 825 830 

Lys Gly His His His His His His 
835 840 

<210> 543 

<211> 2520 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 543 



atgaattcgg 


aggcgatgct 


gcccctcttt 


gagcccaagg 


gccgggtcct 


cctggtggac 


ggccaccacc 


tggcctaccg 


caccttcttt 


gccctgaagg 


gcctcaccac 


cagccggggg 


gagccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


cagagaggac 


ggggacgcgg 


tgatcgtggt 


ctttgacgcc 


gaggccccct 


ccttccgcca 


cgaggcctac 


ggggggtaca 


aggcgggccg 


ggcccccacg 


ccggaggact 


ttccccggca 


actcgccctc 


atcaaggagc 


tggtggacct 


cctggggttc 


acgcgcctcg 


aggtcccggg 


ctacgaggcg 


gacgacgtcc 


tggccaccct 


ggccaagaag 


gcggaaaagg 


agggctacga 


ggtccgcatc 


ctcaccgccg 


acaaagacct 


ttaccagctc 


ctttccgacc 


gcatccacgt 


cctccacccc 


gaggggtacc 


tcatcacccc 


ggcctggctt 


tgggaaaagt 


acggcctgag 


gcccgaccag 


tgggccgact 


accgggccct 


gaccggggac 


gagtccgaca 


accttcccgg 


ggtcaagggc 


atcggggaga 


agacggcgct 


caagcttctg 


aaggagtggg 


ggagcctgga 


agccctcctc 


aagaacctgg 


accggctgaa 


gcccgccatc 


cgggagaaga 


tcctggccca 


catggacgat 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



1002 



ctgaagctct cctgggacct ggccaaggtg cgcaccgacc tgcccctgga ggtggacttc 7 80 

gccaaaaggc gggagcccga ccgggagggg cttaaggcct ttctggagag gcttgagttt 840 

ggcagcctcc tccacgagtt cggccttctg ggaggggaga agccccggga ggaggccccc 90 0 

tggcccccgc cggaaggggc cttcgtgggc tttgtgcttt cccgcaagga gcccatgtgg 960 

gccgatcttc tggccctggc cgcctgcagg ggcggccgcg tgcaccgggc agcagacccc 102 0 

ttggcggggc taaaggacct caaggaggtc cggggcctcc tcgccaagga cctcgccgtc 10 80 

ttggcctcga gggaggggct agacctcgtg cccggggacg accccatgct cctcgcctac 1140 

ctcctggacc cttcgaacac cacccccgag ggggtggcgc ggcgctacgg gggggagtgg 12 0 0 

acggaggacg ccgcccaccg ggccctcctc tcggagaggc tccatcggaa cctccttaag 1260 

cgcctcgagg gggaggagaa gctcctttgg ctctaccacg aggtggaaaa gcccctctcc 1320 

cgggtcctgg cccatatgga ggccaccggg gtacggcggg acgtggccta ccttcaggcc 13 8 0 

ctttccctgg agcttgcgga ggagatccgc cgcctcgagg aggaggtctt ccgcttggcg 144 0 

ggccacccct tcaacctcaa ctcccgggac cagctggaaa gggtgctctt tgacgagctt 1500 

aggcttcccg ccttgaagaa gacgaagaag acaggcaagc gctccaccag cgccgcggtg 1560 

ctggaggccc tacgggaggc ccaccccatc gtggagaaga tcctccagca ccgggagctc 162 0 

accaagctca agaacaccta cgtggacccc ctcccaagcc tcgtccaccc gaggacgggc 1680 

cgcctccaca cccgcttcaa ccagacggcc acggccacgg ggaggcttag tagctccgac 174 0 

cccaacctgc agaacatccc cgtccgcacc cccttgggcc agaggatccg ccgggccttc 1800 

gtggccgagg cgggttgggc gttggtggcc ctggactata gccagataga gctccgcgtc 1860 

ctcgcccacc tctccgggga cgaaaacctg atcagggtct tccaggaggg gaaggacatc 192 0 

cacacccaga ccgcaagctg gatgttcggc gtccccccgg aggccgtgga ccccctgatg 198 0 

cgccgggcgg ccaagacggt gaacttcggc gtcctctacg gcatgtccgc ccataggctc 2040 

tcccaggagc ttgccatccc ctacgaggag gcggtggcct ttatagagcg ctacttccaa 2100 

agcttcccca aggtgcgggc ctggatagaa aagaccctgg aggaggggag gaagcggggc 2160 

tacgtggaaa ccctcttcgg aagaaggcgc tacgtgcccg acctcaacgc ccgggtgaag 2220 
aggccgcgga gcgcatggcc ttcaacatgc ccgtccaggg caccgccgcc 2280 

gacctcatga agctcgccat ggtgaagctc ttcccccgcc tccgggagat gggggcccgc 234 0 

atgctcctcc aggtcgccaa cgagctcctc ctggaggccc cccaagcgcg ggccgaggag 24 0 0 

gtggcggctt tggccaagga ggccatggag aaggcctatc ccctcgccgt gcccctggag 2460 



1003 



gtggaggtgg ggatggggga ggactggctt tccgccaagg gtcaccacca ccaccaccac 2520 



<210> 544 

<211> 840 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 544 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Ala Val 
50 55 60 

He Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr 
65 70 75 80 

Gly Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg 
85 90 95 

Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg 
100 105 110 

Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala 
115 120 125 

Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp 
130 135 140 

Lys Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro 
145 150 155 160 

Glu Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu 
165 170 175 

Arg Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser 
180 185 190 

Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu Lys 
195 200 205 

Leu Leu Lys Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp 
210 215 220 

Arg Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp 



1004 



225 



230 



235 



240 



Leu Lys Leu Ser Trp 
245 

Glu Val Asp Phe Ala 
260 

Ala Phe Leu Glu Arg 
275 

Leu Leu Gly Gly Glu 
290 

Glu Gly Ala Phe Val 
305 

Ala Asp Leu Leu Ala 
325 

Ala Ala Asp Pro Leu 
340 

Leu Leu Ala Lys Asp 
355 

Leu Val Pro Gly Asp 
370 

Ser Asn Thr Thr Pro 
385 

Thr Glu Asp Ala Ala 
405 

Asn Leu Leu Lys Arg 
420 

His Glu Val Glu Lys 
435 

Thr Gly Val Arg Arg 
450 

Leu Ala Glu Glu lie 
465 

Gly His Pro Phe Asn 
485 

Phe Asp Glu Leu Arg 
500 

Lys Arg Ser Thr Ser 
515 

Pro lie Val Glu Lys 



Asp Leu Ala Lys Val Arg 
250 

Lys Arg Arg Glu Pro Asp 
265 

Leu Glu Phe Gly Ser Leu 
280 

Lys Pro Arg Glu Glu Ala 
295 

Gly Phe Val Leu Ser Arg 
310 315 

Leu Ala Ala Cys Arg Gly 
330 

Ala Gly Leu Lys Asp Leu 
345 

Leu Ala Val Leu Ala Ser 
360 

Asp Pro Met Leu Leu Ala 
375 

Glu Gly Val Ala Arg Arg 
390 395 

His Arg Ala Leu Leu Ser 
410 

Leu Glu Gly Glu Glu Lys 
425 

Pro Leu Ser Arg Val Leu 
440 

Asp Val Ala Tyr Leu Gin 
455 

Arg Arg Leu Glu Glu Glu 
470 475 

Leu Asn Ser Arg Asp Gin 
490 

Leu Pro Ala Leu Lys Lys 
505 

Ala Ala Val Leu Glu Ala 
520 

lie Leu Gin His Arg Glu 



Thr Asp Leu Pro Leu 
255 

Arg Glu Gly Leu Lys 
270 

Leu His Glu Phe Gly 
285 

Pro Trp Pro Pro Pro 
300 

Lys Glu Pro Met Trp 
320 

Gly Arg Val His Arg 
335 

Lys Glu Val Arg Gly 
350 

Arg Glu Gly Leu Asp 
365 

Tyr Leu Leu Asp Pro 
380 

Tyr Gly Gly Glu Trp 
400 

Glu Arg Leu His Arg 
415 

Leu Leu Trp Leu Tyr 
430 

Ala His Met Glu Ala 
445 

Ala Leu Ser Leu Glu 
460 

Val Phe Arg Leu Ala 
480 

Leu Glu Arg Val Leu 
495 

Thr Lys Lys Thr Gly 
510 

Leu Arg Glu Ala His 
525 

Leu Thr Lys Leu Lys 



1005 



530 535 540 

Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 
595 600 605 

Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He 
625 630 635 640 

His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val 
645 650 655 

Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr 
675 680 685 

Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 
725 730 735 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 
770 775 780 

Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
785 790 795 800 

Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 
820 825 830 

Lys Gly His His His His His His 



1006 



835 



840 



<210> 545 

<211> 50 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 545 

ggggagaaga cggcgctcag gcttctgaag gagtggggga gcctggaagc 



<210> 546 

<211> 50 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 546 

gcttccaggc tcccccactc cttcagaagc ttgagcgccg tcttctcccc 



<210> 547 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 547 

ctggtcggga cggacgccaa tgagggtgtg aag 



<210> 548 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 548 
gtccgtcccg accagaat 



<210> 549 

<211> 1008 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



1007 



<400> 549 





3.tgggtgcgg 


atattggtga 


cctctttgag 




tcgagcttga 


gtacttctca 


60 




ggaaagaaaa 


ttgccgttga 


tgctttcaac 


acgctatacc 


agttcatctc 


gataataagg 


120 




cagcctgacg 


gtacgccgtt 


aaaggactca 


cagggcagaa 


tcacctctca 


cctttccgga 


180 




atcctataca 


gagtctccaa 


catggtcgag 


gtgggaatca 


ggccggtgtt 


tgtattcgac 


240 




ggagagccac 


cggagttcaa 


gaaggctgaa 


attgaggaga 


ggaaaaagag 


^^999ctgag 


300 




gcagaggaga 


tgtggattgc 


ggctttgcag 


gcaggagata 


aggacgcgaa 


aaagtatgct 


360 




caggctgcag 


ggagggttga 


cgagtacatt 


gttgactccg 


caaagacgct 


tttaagttac 


420 




atggggattc 


cctttgtcga 


tgccccgtct 


gaaggagagg 


cgcaggctgc 


ttacatggca 


480 


■£1 


gcaaaaggcg 


atgtggagta 


cacaggaagc 


caggattacg 


attctctgct 


cttcggaagc 


540 




ccgagactcg 


ccagaaatct 


cgcaataacg 


ggaaaaagga 


agcttcccgg 


caaaaatgtc 


600 




tatgtggatg 


taaagccgga 


gataataatt 


ctqqaaagca 


acctcaaaag 


gctgggtttg 


660 


m 


acqaqqqaqc 


agctcatcga 


catagcgatt 


ctqqtcqqqa 


cggacgccaa 


tqaqqqtqtq 


72 0 




aagggtgtcg 


gcgtcaagaa 


ggctttgaac 


tacatcaaga 


cctacggaga 


tattttcagg 


780 




gcactcaagg 


ctctgaaagt 


aaatattgac 


cacgtagagg 


agataaggaa 


tttcttcctg 


840 




aatcctcctg 


tgactgacga 


ctacagaata 


gagttcaggg 


agcctgactt 


tgagaaggcc 


900 




atcgagttcc 


tgtgcgagga 


gcacgacttc 


agcagggaga 


gggtcgagaa 


ggccttggag 


960 




aagctcaaag 


ctctgaagtc 


aacccaggcc 


acgcttgaga 


ggtggttc 




1008 



<210> 550 

<211> 336 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 550 

Met Gly Ala Asp He Gly Asp Leu Phe Glu Arg Glu Glu Val Glu Leu 
15 10 15 

Glu Tyr Phe Ser Gly Lys Lys He Ala Val Asp Ala Phe Asn Thr Leu 
20 25 30 

Tyr Gin Phe He Ser He He Arg Gin Pro Asp Gly Thr Pro Leu Lys 
35 40 45 

Asp Ser Gin Gly Arg He Thr Ser His Leu Ser Gly He Leu Tyr Arg 

1008 



50 55 60 

Val Ser Asn Met Val Glu Val Gly lie Arg Pro Val Phe Val Phe Asp 
65 70 75 80 

Gly Glu Pro Pro Glu Phe Lys Lys Ala Glu lie Glu Glu Arg Lys Lys 
85 90 95 

Arg Arg Ala Glu Ala Glu Glu Met Trp He Ala Ala Leu Gin Ala Gly 
100 105 110 

Asp Lys Asp Ala Lys Lys Tyr Ala Gin Ala Ala Gly Arg Val Asp Glu 
115 120 125 

Tyr He Val Asp Ser Ala Lys Thr Leu Leu Ser Tyr Met Gly He Pro 
130 135 140 

Phe Val Asp Ala Pro Ser Glu Gly Glu Ala Gin Ala Ala Tyr Met Ala 
145 150 155 160 

Ala Lys Gly Asp Val Glu Tyr Thr Gly Ser Gin Asp Tyr Asp Ser Leu 
165 170 175 

Leu Phe Gly Ser Pro Arg Leu Ala Arg Asn Leu Ala He Thr Gly Lys 
180 185 190 

Arg Lys Leu Pro Gly Lys Asn Val Tyr Val Asp Val Lys Pro Glu He 
195 200 205 

He He Leu Glu Ser Asn Leu Lys Arg Leu Gly Leu Thr Arg Glu Gin 
210 215 220 

Leu He Asp He Ala He Leu Val Gly Thr Asp Ala Asn Glu Gly Val 
225 230 235 240 

Lys Gly Val Gly Val Lys Lys Ala Leu Asn Tyr He Lys Thr Tyr Gly 
245 250 255 

Asp He Phe Arg Ala Leu Lys Ala Leu Lys Val Asn He Asp His Val 
260 265 270 

Glu Glu He Arg Asn Phe Phe Leu Asn Pro Pro Val Thr Asp Asp Tyr 
275 280 285 

Arg He Glu Phe Arg Glu Pro Asp Phe Glu Lys Ala He Glu Phe Leu 
290 295 300 

Cys Glu Glu His Asp Phe Ser Arg Glu Arg Val Glu Lys Ala Leu Glu 
305 310 315 320 

Lys Leu Lys Ala Leu Lys Ser Thr Gin Ala Thr Leu Glu Arg Trp Phe 
325 330 335 

<210> 551 
<211> 33 
<212> DNA 
<213> Artificial 



1009 



<220> 

<223> Synthetic 

<400> 551 

ctggtcggga cggacaggaa tgagggtgtg aag 3 3 

<210> 552 

<211> 1008 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 552 



atgggtgcgg 


atattggtga 


cctctttgag 




tcgagcttga 


gtacttctca 


60 


ggaaagaaaa 


ttgccgttga 


tgctttcaac 


acgctatacc 


agttcatctc 


gataataagg 


120 


cagcctgacg 


gtacgccgtt 


aaaggactca 


cagggcagaa 


tcacctctca 


cctttccgga 


180 


atcctataca 


gagtctccaa 


catggtcgag 


gtgggaatca 


ggccggtgtt 


tgtattcgac 


240 


ggagagccac 


cggagttcaa 


gaaggctgaa 


attgaggaga 


ggaaaaagag 


3-agggctgag 


300 


gcagaggaga 


tgtggattgc 


ggctttgcag 


gcaggagata 


aggacgcgaa 


aaagtatgct 


360 


caggctgcag 


9g^g9gttga 


cgagtacatt 


gttgactccg 


caaagacgct 


tttaagttac 


420 


atggggattc 


cctttgtcga 


tgccccgtct 


gaaggagagg 


cgcaggctgc 


ttacatggca 


480 


gcaaaaggcg 


atgtggagta 


cacaggaagc 


caggattacg 


attctctgct 


cttcggaagc 


540 


ccgagactcg 


ccagaaatct 


cgcaataacg 


ggaaaaagga 


agcttcccgg 


caaaaatgtc 


600 


tatgtggatg 


taaagccgga 


gataataatt 


ctggaaagca 


acctcaaaag 


gctgggtttg 


660 


acgagggagc 


agctcatcga 


catagcgatt 


ctggtcggga 


cggacaggaa 


tgagggtgtg 


720 


^^ggg^stcg 


gcgtcaagaa 


ggctttgaac 


tacatcaaga 


cctacggaga 


tattttcagg 


780 


gcactcaagg 


ctctgaaagt 


aaatattgac 


cacgtagagg 


agataaggaa 


tttcttcctg 


840 


aatcctcctg 


tgactgacga 


ctacagaata 


gagttcaggg 


agcctgactt 


tgagaaggcc 


900 


atcgagttcc 


tgtgcgagga 


gcacgacttc 


agcagggaga 


gggtcgagaa 


ggccttggag 


960 


aagctcaaag 


ctctgaagtc 


aacccaggcc 


acgcttgaga 


sgtggttc 




1008 



<210> 553 

<211> 336 

<212> PRT 

<213> Artificial 



1010 



<220> 

<223> Synthetic 



<400> 553 

Met Gly Ala Asp He Gly Asp Leu Phe Glu Arg Glu Glu Val Glu Leu 
15 10 15 

Glu Tyr Phe Ser Gly Lys Lys He Ala Val Asp Ala Phe Asn Thr Leu 
20 25 30 

Tyr Gin Phe He Ser He He Arg Gin Pro Asp Gly Thr Pro Leu Lys 
35 40 45 

Asp Ser Gin Gly Arg He Thr Ser His Leu Ser Gly He Leu Tyr Arg 
50 55 60 

Val Ser Asn Met Val Glu Val Gly He Arg Pro Val Phe Val Phe Asp 
65 70 75 80 

Gly Glu Pro Pro Glu Phe Lys Lys Ala Glu He Glu Glu Arg Lys Lys 
85 90 95 

Arg Arg Ala Glu Ala Glu Glu Met Trp He Ala Ala Leu Gin Ala Gly 
100 105 110 

Asp Lys Asp Ala Lys Lys Tyr Ala Gin Ala Ala Gly Arg Val Asp Glu 
115 120 125 

Tyr He Val Asp Ser Ala Lys Thr Leu Leu Ser Tyr Met Gly He Pro 
130 135 140 

Phe Val Asp Ala Pro Ser Glu Gly Glu Ala Gin Ala Ala Tyr Met Ala 
145 150 155 160 

Ala Lys Gly Asp Val Glu Tyr Thr Gly Ser Gin Asp Tyr Asp Ser Leu 
165 170 175 

Leu Phe Gly Ser Pro Arg Leu Ala Arg Asn Leu Ala He Thr Gly Lys 
180 185 190 

Arg Lys Leu Pro Gly Lys Asn Val Tyr Val Asp Val Lys Pro Glu He 
195 200 205 

He He Leu Glu Ser Asn Leu Lys Arg Leu Gly Leu Thr Arg Glu Gin 
210 215 220 

Leu He Asp He Ala He Leu Val Gly Thr Asp Arg Asn Glu Gly Val 
225 230 235 240 

Lys Gly Val Gly Val Lys Lys Ala Leu Asn Tyr He Lys Thr Tyr Gly 
245 250 255 

Asp He Phe Arg Ala Leu Lys Ala Leu Lys Val Asn He Asp His Val 
260 265 270 

Glu Glu He Arg Asn Phe Phe Leu Asn Pro Pro Val Thr Asp Asp Tyr 



1011 



275 



280 



285 



Arg lie Glu Phe 
290 



Arg Glu Pro Asp Phe Glu Lys Ala lie Glu Phe Leu 
295 300 



Cys Glu Glu His 
305 



Asp Phe Ser Arg Glu Arg Val Glu Lys Ala Leu Glu 
310 315 320 



Lys Leu Lys Ala 



Leu Lys Ser Thr Gin Ala Thr Leu Glu Arg Trp Phe 
325 330 335 



<210> 554 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 554 

gaaccacctc tcaagcgtgg 2 0 



<210> 555 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 555 

acgcttgaga ggtggttcct ggaggaggcc ccctgg 3 6 

<210> 556 
<211> 1011 
<212> DNA 

<213> Archaeoglobus fulgldus 
<400> 556 

^tgggtgcgg atattggtga cctctttgag agggaagagg tcgagcttga gtacttctca 60 
ggaaagaaaa ttgccgttga tgctttcaac acgctatacc agttcatctc gataataagg 12 0 
cagcctgacg gtacgccgtt aaaggactca cagggcagaa tcacctctca cctttccgga 180 
atcctataca gagtctccaa catggtcgag gtgggaatca ggccggtgtt tgtattcgac 240 
ggagagccac cggagttcaa gaaggctgaa attgaggaga ggaaaaagag aagggctgag 300 
gcagaggaga tgtggattgc ggctttgcag gcaggagata aggacgcgaa aaagtatgct 3 60 
caggctgcag ggagggttga cgagtacatt gttgactccg caaagacgct tttaagttac 42 0 
^tggggattc cctttgtcga tgccccgtct gaaggagagg cgcaggctgc ttacatggca 480 



1012 



gcaaaaggcg atgtggagta cacaggaagc caggattacg attctctgct cttcggaagc 



540 



ccgagactcg ccagaaatct cgcaataacg ggaaaaagga agcttcccgg caaaaatgtc 



600 



tatgtggatg taaagccgga gataataatt ctggaaagca acctcaaaag gctgggtttg 



660 



acgagggagc agctcatcga catagcgatt ctggtcggga cggactacaa tgagggtgtg 



720 



sa999tgtcg gcgtcaagaa ggctttgaac tacatcaaga cctacggaga tattttcagg 



780 



gcactcaagg ctctgaaagt aaatattgac cacgtagagg agataaggaa tttcttcctg 



840 



aatcctcctg tgactgacga ctacagaata gagttcaggg agcctgactt tgagaaggcc 



900 



atcgagttcc tgtgcgagga gcacgacttc agcagggaga gggtcgagaa ggccttggag 960 



<210> 557 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 557 

taatctgtat caggctg 17 



<210> 558 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 558 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 24 0 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 3 00 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 420 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 54 0 



aagctcaaag ctctgaagtc aacccaggcc acgcttgaga ggtggttctg a 



1011 



1013 



cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


ggg^cgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


g^gtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 



1014 



gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 
gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 
gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 
gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 
ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 
caccac 

<210> 559 

<211> 2643 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 559 

atgggtgcgg atattggtga cctctttgag agggaagagg tcgagcttga gtacttctca 
ggaaagaaaa ttgccgttga tgctttcaac acgctatacc agttcatctc gataataagg 
cagcctgacg gtacgccgtt aaaggactca cagggcagaa tcacctctca cctttccgga 
atcctataca gagtctccaa catggtcgag gtgggaatca ggccggtgtt tgtattcgac 
ggagagccac cggagttcaa gaaggctgaa attgaggaga ggaaaaagag aagggctgag 
gcagaggaga tgtggattgc ggctttgcag gcaggagata aggacgcgaa aaagtatgct 
caggctgcag ggagggttga cgagtacatt gttgactccg caaagacgct tttaagttac 
atggggattc cctttgtcga tgccccgtct gaaggagagg cgcaggctgc ttacatggca 
gcaaaaggcg atgtggagta cacaggaagc caggattacg attctctgct cttcggaagc 
ccgagactcg ccagaaatct cgcaataacg ggaaaaagga agcttcccgg caaaaatgtc 
tatgtggatg taaagccgga gataataatt ctggaaagca acctcaaaag gctgggtttg 
acgagggagc agctcatcga catagcgatt ctggtcggga cggactacaa tgagggtgtg 
aagggtgtcg gcgtcaagaa ggctttgaac tacatcaaga cctacggaga tattttcagg 
gcactcaagg ctctgaaagt aaatattgac cacgtagagg agataaggaa tttcttcctg 
aatcctcctg tgactgacga ctacagaata gagttcaggg agcctgactt tgagaaggcc 
atcgagttcc tgtgcgagga gcacgacttc agcagggaga gggtcgagaa ggccttggag 
aagctcaaag ctctgaagtc aacccaggcc acgcttgaga ggtggttcct ggaggaggcc 
ccctggcccc cgccggaagg ggccttcgtg ggcttcgtcc tctcccgccc cgagcccatg 

1015 



tgggcggagc 


ttaaagccct 


ggccgcctgc 


aggggcggcc 


gcgtgcaccg 


ggcagcagac 


1140 


cccttggcgg 


ggctaaagga 


cctcaaggag 


gtccggggcc 


tcctcgccaa 


ggacctcgcc 


1200 


gtcttggcct 


cgagggaggg 


gctagacctc 


gtgcccgggg 


acgaccccat 


gctcctcgcc 


1260 


tacctcctgg 


acccttcgaa 


caccaccccc 


gagggggtgg 


cgcggcgcta 


cgggggggag 


1320 


tggacggagg 


acgccgccca 


ccgggccctc 


ctctcggaga 


ggctccatcg 


gaacctcctt 


1380 


aagcgcctcg 


agggggagga 


gaagctcctt 


tggctctacc 


acgaggtgga 


aaagcccctc 


1440 


tcccgggtcc 


tggcccatat 


ggsggccacc 


ggggtacggc 


gggacgtggc 


ctaccttcag 


1500 


gccctttccc 


tggagcttgc 


ggaggagatc 


cgccgcctcg 


aggaggaggt 


cttccgcttg 


1560 


gcgggccacc 


ccttcaacct 


caactcccgg 


gaccagctgg 


aaagggtgct 


ctttgacgag 


1620 


cttaggcttc 


ccgccttgaa 


gaagacgaag 


aagacaggca 


agcgctccac 


cagcgccgcg 


1680 


gtgctggagg 


ccctacggga 


ggcccacccc 


atcgtggaga 


agatcctcca 


gcaccgggag 


1740 


ctcaccaagc 


tcaagaacac 


ctacgtggac 


cccctcccaa 


gcctcgtcca 


cccgaggacg 


1800 


ggccgcctcc 


acacccgctt 


caaccagacg 


gccacggcca 


cggggaggct 


tagtagctcc 


1860 


gaccccaacc 


tgcagaacat 


ccccgtccgc 


acccccttgg 


gccagaggat 


ccgccgggcc 


1920 


ttcgtggccg 


aggcgggttg 


ggcgttggtg 


gccctggact 


atagccagat 


agagctccgc 


1980 


gtcctcgccc 


acctctccgg 


ggacgaaaac 


ctgatcaggg 


tcttccagga 


ggggaaggac 


2040 


atccacaccc 


agaccgcaag 


ctggatgttc 


ggcgtccccc 


cggaggccgt 


ggaccccctg 


2100 


atgcgccggg 


cggccaagac 


ggtgaacttc 


ggcgtcctct 


acggcatgtc 


cgcccatagg 


2160 


ctctcccagg 


agcttgccat 


cccctacgag 


gaggcggtgg 


cctttataga 


gcgctacttc 


2220 


caaagcttcc 


ccaaggtgcg 


ggcctggata 


gaaaagaccc 


tggaggaggg 


gaggaagcgg 


2280 


ggctacgtgg 


aaaccctctt 


cggaagaagg 


cgctacgtgc 


ccgacctcaa 


cgcccgggtg 


2340 


aagagcgtca 


gggaggccgc 


ggagcgcatg 


gccttcaaca 


tgcccgtcca 


gggcaccgcc 


2400 


gccgacctca 


tgaagctcgc 


catqgtqaaq 


ctcttccccc 


qCCtCCqqqa 


qatqqqqqcc 


2460 


cgcatgctcc 


tccaggtcgc 


caacgagctc 


ctcctggagg 


ccccccaagc 


gcgggccgag 


2520 


gaggtggcgg 


ctttggccaa 


ggaggccatg 


gagaaggcct 


atcccctcgc 


cgtgcccctg 


2580 


gaggtggagg 


tggggatggg 


ggaggactgg 


ctttccgcca 


agggtcacca 


ccaccaccac 


2640 


cac 












2643 



<210> 560 



1016 



<211> 881 
<212> PRT 
<213> Artificial 

<220> 

<22 3> Synthetic 
<400> 560 

Met Gly Ala Asp lie Gly Asp Leu Phe Glu Arg Glu Glu Val Glu Leu 
15 10 15 

Glu Tyr Phe Ser Gly Lys Lys He Ala Val Asp Ala Phe Asn Thr Leu 
20 25 30 

Tyr Gin Phe He Ser He He Arg Gin Pro Asp Gly Thr Pro Leu Lys 
35 40 45 

Asp Ser Gin Gly Arg He Thr Ser His Leu Ser Gly He Leu Tyr Arg 
50 55 60 

Val Ser Asn Met Val Glu Val Gly He Arg Pro Val Phe Val Phe Asp 
65 70 75 80 

Gly Glu Pro Pro Glu Phe Lys Lys Ala Glu He Glu Glu Arg Lys Lys 
85 90 95 

Arg Arg Ala Glu Ala Glu Glu Met Trp He Ala Ala Leu Gin Ala Gly 
100 105 110 

Asp Lys Asp Ala Lys Lys Tyr Ala Gin Ala Ala Gly Arg Val Asp Glu 
115 120 125 

Tyr He Val Asp Ser Ala Lys Thr Leu Leu Ser Tyr Met Gly He Pro 
130 135 140 

Phe Val Asp Ala Pro Ser Glu Gly Glu Ala Gin Ala Ala Tyr Met Ala 
145 150 155 160 

Ala Lys Gly Asp Val Glu Tyr Thr Gly Ser Gin Asp Tyr Asp Ser Leu 
165 170 175 

Leu Phe Gly Ser Pro Arg Leu Ala Arg Asn Leu Ala He Thr Gly Lys 
180 185 190 

Arg Lys Leu Pro Gly Lys Asn Val Tyr Val Asp Val Lys Pro Glu He 
195 200 205 

He He Leu Glu Ser Asn Leu Lys Arg Leu Gly Leu Thr Arg Glu Gin 
210 215 220 

Leu He Asp He Ala He Leu Val Gly Thr Asp Tyr Asn Glu Gly Val 
225 230 235 240 

Lys Gly Val Gly Val Lys Lys Ala Leu Asn Tyr He Lys Thr Tyr Gly 
245 250 255 



1017 



Asp He Phe Arg Ala Leu Lys Ala Leu Lys Val Asn He Asp His Val 
260 265 270 

Glu Glu He Arg Asn Phe Phe Leu Asn Pro Pro Val Thr Asp Asp Tyr 
275 280 285 

Arg He Glu Phe Arg Glu Pro Asp Phe Glu Lys Ala He Glu Phe Leu 
290 295 300 

Cys Glu Glu His Asp Phe Ser Arg Glu Arg Val Glu Lys Ala Leu Glu 
305 310 315 320 

Lys Leu Lys Ala Leu Lys Ser Thr Gin Ala Thr Leu Glu Arg Trp Phe 
325 330 335 

Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Phe 
340 345 350 

Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu Leu Lys Ala Leu Ala 
355 360 365 

Ala Cys Arg Gly Gly Arg Val His Arg Ala Ala Asp Pro Leu Ala Gly 
370 375 380 

Leu Lys Asp Leu Lys Glu Val Arg Gly Leu Leu Ala Lys Asp Leu Ala 
385 390 395 400 

Val Leu Ala Ser Arg Glu Gly Leu Asp Leu Val Pro Gly Asp Asp Pro 
405 410 415 

Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly 
420 425 430 

Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu Asp Ala Ala His Arg 
435 440 445 

Ala Leu Leu Ser Glu Arg Leu His Arg Asn Leu Leu Lys Arg Leu Glu 
450 455 460 

Gly Glu Glu Lys Leu Leu Trp Leu Tyr His Glu Val Glu Lys Pro Leu 
465 470 475 480 

Ser Arg Val Leu Ala His Met Glu Ala Thr Gly Val Arg Arg Asp Val 
485 490 495 

Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu Ala Glu Glu He Arg Arg 
500 505 510 

Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn 
515 520 525 

Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu Arg Leu Pro 
530 535 540 

Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala 
545 550 555 560 



1018 



Val Leu Glu Ala Leu Arg Glu Ala His Pro lie Val Glu Lys He Leu 
565 570 575 

Gin His Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr Val Asp Pro Leu 
580 585 590 

Pro Ser Leu Val His Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn 
595 600 605 

Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu 
610 615 620 

Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin Arg He Arg Arg Ala 
625 630 635 640 

Phe Val Ala Glu Ala Gly Trp Ala Leu Val Ala Leu Asp Tyr Ser Gin 
645 650 655 

He Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu He 
660 665 670 

Arg Val Phe Gin Glu Gly Lys Asp He His Thr Gin Thr Ala Ser Trp 
675 680 685 

Met Phe Gly Val Pro Pro Glu Ala Val Asp Pro Leu Met Arg Arg Ala 
690 695 700 

Ala Lys Thr Val Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg 
705 710 715 720 

Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu Ala Val Ala Phe He 
725 730 735 

Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys 
740 745 750 

Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr Val Glu Thr Leu Phe Gly 
755 760 765 

Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg Val Lys Ser Val Arg 
770 775 780 

Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala 
785 790 795 800 

Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu Arg 
805 810 815 

Glu Met Gly Ala Arg Met Leu Leu Gin Val Ala Asn Glu Leu Leu Leu 

820 825 830 

Glu Ala Pro Gin Ala Arg Ala Glu Glu Val Ala Ala Leu Ala Lys Glu 
835 840 845 

Ala Met Glu Lys Ala Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val 
850 855 860 



1019 



Gly Met Gly Glu Asp Trp Leu Ser Ala Lys Gly His His His His His 
865 870 875 880 

His 



<210> 561 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 561 

ggttgacttc agagctttga g 



<210> 562 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 562 

aaagctctga agtcaaccct ggaggaggcc ccctgg 



<210> 563 

<211> 2619 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 563 

^tgggtgcgg atattggtga cctctttgag agggaagagg tcgagcttga gtacttctca 
ggaaagaaaa ttgccgttga tgctttcaac acgctatacc agttcatctc gataataagg 
cagcctgacg gtacgccgtt aaaggactca cagggcagaa tcacctctca cctttccgga 
atcctataca gagtctccaa catggtcgag gtgggaatca ggccggtgtt tgtattcgac 
ggagagccac cggagttcaa gaaggctgaa attgaggaga ggaaaaagag aagggctgag 
gcagaggaga tgtggattgc ggctttgcag gcaggagata aggacgcgaa aaagtatgct 
caggctgcag ggagggttga cgagtacatt gttgactccg caaagacgct tttaagttac 
^tggggattc cctttgtcga tgccccgtct gaaggagagg cgcaggctgc ttacatggca 
gcaaaaggcg atgtggagta cacaggaagc caggattacg attctctgct cttcggaagc 



1020 



ccgagactcg 


ccagaaatct 


cgcaataacg 


ggaaaaagga 


agcttcccgg 


caaaaatgtc 


600 


tatgtggatg 


taaagccgga 


gataataatt ctggaaagca acctcaaaag gctgggtttg 


660 


acgagggagc 


agctcatcga 


catagcgatt 


ctggtcggga 


cggactacaa 


tgagggtgtg 


720 


aagggtgtcg 


gcgtcaagaa 


ggctttgaac 


tacatcaaga 


cctacggaga 


tattttcagg 


780 


gcactcaagg 


ctctgaaagt 


aaatattgac 


cacgtagagg 


agataaggaa 


tttcttcctg 


840 


aatcctcctg 


tgactgacga 


ctacagaata 


gagttcaggg 


agcctgactt 


tgagaaggcc 


900 


atcgagttcc 


tgtgcgagga 


gcacgacttc 


agcagggaga 


gggtcgagaa 


ggccttggag 


960 


aagctcaaag 


ctctgaagtc 


aaccctggag 


gaggccccct 


ggcccccgcc 


ggaaggggcc 


1020 


ttcgtgggct 


tcgtcctctc 


ccgccccgag 


cccatgtggg 


cggagcttaa 


agccctggcc 


1080 


gcctgcaggg gcggccgcgt gcaccgggca gcagacccct tggcggggct 


aaaggacctc 


1140 


aaggaggtcc 


ggggcctcct 


cgccaaggac 


ctcgccgtct 


tggcctcgag ggaggggcta 


1200 


gacctcgtgc 


ccggggacga 


ccccatgctc 


ctcgcctacc 


tcctggaccc 


ttcgaacacc 


1260 


acccccgagg 


gggtggcgcg 


gcgctacggg 


ggggagtgga 


cggaggacgc 


cgcccaccgg 


1320 


gccctcctct 


cggagaggct 


ccatcggaac 


ctccttaagc 


gcctcgaggg 


ggaggagaag 


1380 


ctcctttggc 


tctaccacga 


ggtggaaaag 


cccctctccc 


gggtcctggc 


ccatatggag 


1440 


nr^r*^ nc rmnn 

y ^^«-*-^yyyy 




cgtggcctac 


cttcaggccc 


tttccctgga 


gcttgcggag 


1500 


gagatccgcc 


gcctcgagga 


ggaggtcttc 


cgcttggcgg gccacccctt 


caacctcaac 


1560 


tcccgggacc 


agctggaaag 


ggtgctcttt 


gacgagctta 


ggcttcccgc 


cttgaagaag 


1620 


acgaagaaga 


caggcaagcg 


ctccaccagc 


gccgcggtgc 


tggaggccct 


acgggaggcc 


1680 


caccccatcg 


tggagaagat 


cctccagcac 


cgggagctca 


ccaagctcaa 


gaacacctac 


1740 


gtggaccccc 


tcccaagcct 


cgtccacccg 


aggacgggcc 


gcctccacac 


ccgcttcaac 


1800 


cagacggcca 


cggccacggg 


gaggcttagt 


agctccgacc 


ccaacctgca 


gaacatcccc 


1860 


gtccgcaccc 


ccttgggcca 


gaggatccgc 


cgggccttcg 


tggccgaggc 


gggttgggcg 


1920 


ttggtggccc 


tggactatag 


ccagatagag 


ctccgcgtcc 


tcgcccacct 


ctccggggac 


1980 


gaaaacctga 


tcagggtctt 


ccaggagggg 


aaggacatcc 


acacccagac 


cgcaagctgg 


2040 


atgttcggcg 


tccccccgga 


ggccgtggac 


cccctgatgc 


gccgggcggc 


caagacggtg 


2100 


aacttcggcg 


tcctctacgg 


catgtccgcc 


cataggctct 


cccaggagct 


tgccatcccc 


2160 


tacgaggagg 


cggtggcctt 


tatagagcgc 


tacttccaaa 


gcttccccaa 


ggtgcgggcc 


2220 


tggatagaaa 


agaccctgga 


ggaggggagg 


aagcggggct 


acgtggaaac 


cctcttcgga 


2280 



1021 



2619 



agaaggcgct acgtgcccga cctcaacgcc cgggtgaaga gcgtcaggga ggccgcggag 2340 

cgcatggcct tcaacatgcc cgtccagggc accgccgccg acctcatgaa gctcgccatg 24 00 

gtgaagctct tcccccgcct ccgggagatg ggggcccgca tgctcctcca ggtcgccaac 2460 

gagctcctcc tggaggcccc ccaagcgcgg gccgaggagg tggcggcttt ggccaaggag 2520 

gccatggaga aggcctatcc cctcgccgtg cccctggagg tggaggtggg gatgggggag 2580 
gactggcttt ccgccaaggg tcaccaccac caccaccac 



<210> 564 

<211> 873 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 564 

Met Gly Ala Asp He Gly Asp Leu Phe Glu Arg Glu Glu Val Glu Leu 
15 10 15 

Glu Tyr Phe Ser Gly Lys Lys He Ala Val Asp Ala Phe Asn Thr Leu 
20 25 30 

Tyr Gin Phe He Ser He He Arg Gin Pro Asp Gly Thr Pro Leu Lys 
35 40 45 

Asp Ser Gin Gly Arg He Thr Ser His Leu Ser Gly He Leu Tyr Arg 
50 55 60 

Val Ser Asn Met Val Glu Val Gly He Arg Pro Val Phe Val Phe Asp 
65 70 75 80 

Gly Glu Pro Pro Glu Phe Lys Lys Ala Glu He Glu Glu Arg Lys Lys 
85 90 95 

Arg Arg Ala Glu Ala Glu Glu Met Trp He Ala Ala Leu Gin Ala Gly 
100 105 110 

Asp Lys Asp Ala Lys Lys Tyr Ala Gin Ala Ala Gly Arg Val Asp Glu 
115 120 125 

Tyr He Val Asp Ser Ala Lys Thr Leu Leu Ser Tyr Met Gly He Pro 
130 135 140 

Phe Val Asp Ala Pro Ser Glu Gly Glu Ala Gin Ala Ala Tyr Met Ala 
145 150 155 160 

Ala Lys Gly Asp Val Glu Tyr Thr Gly Ser Gin Asp Tyr Asp Ser Leu 
165 170 175 



1022 



Leu Phe Gly Ser Pro Arg Leu Ala Arg Asn Leu Ala lie Thr Gly Lys 
180 185 190 



Arg Lys Leu Pro Gly Lys Asn Val Tyr Val Asp Val Lys Pro Glu lie 
195 200 205 

lie lie Leu Glu Ser Asn Leu Lys Arg Leu Gly Leu Thr Arg Glu Gin 
210 215 220 

Leu lie Asp lie Ala He Leu Val Gly Thr Asp Tyr Asn Glu Gly Val 
225 230 235 240 

Lys Gly Val Gly Val Lys Lys Ala Leu Asn Tyr He Lys Thr Tyr Gly 
245 250 255 

Asp He Phe Arg Ala Leu Lys Ala Leu Lys Val Asn He Asp His Val 
260 265 270 

Glu Glu He Arg Asn Phe Phe Leu Asn Pro Pro Val Thr Asp Asp Tyr 
275 280 285 

Arg He Glu Phe Arg Glu Pro Asp Phe Glu Lys Ala He Glu Phe Leu 
290 295 300 

Cys Glu Glu His Asp Phe Ser Arg Glu Arg Val Glu Lys Ala Leu Glu 
305 310 315 320 

Lys Leu Lys Ala Leu Lys Ser Thr Leu Glu Glu Ala Pro Trp Pro Pro 
325 330 335 

Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met 
340 345 350 

Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val His 
355 360 365 

Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg 
370 375 380 

Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu 
385 390 395 400 

Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp 
405 410 415 

Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu 
420 425 430 

Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His 
435 440 445 

Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu 
450 455 460 

Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu 
465 470 475 480 



1023 



Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu 
485 490 495 

Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu 
500 505 510 

Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val 
515 520 525 

Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr 
530 535 540 

Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala 
545 550 555 560 

His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu 
565 570 575 

Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr 
580 585 590 

Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg 
595 600 605 

Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro 
610 615 620 

LeJ Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala 
625 630 635 640 

Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His 
645 650 655 

Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp 
660 665 670 

He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala 
675 680 685 

Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val 
690 695 700 

Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro 
705 710 715 720 

Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro 
725 730 735 

Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg 
740 745 750 

Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu 
755 760 765 

Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe 
770 775 780 



1024 



Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met 

785 790 795 800 

Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu 

805 810 815 

Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu 

820 825 830 

Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu 

835 840 845 

Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser 



Ala Lys Gly His His His His His His 
865 870 

<210> 565 
<211> 2643 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 565 

^tgggtgcgg atattggtga cctctttgag agggaagagg tcgagcttga gtacttctca 60 

ggaaagaaaa ttgccgttga tgctttcaac acgctatacc agttcatctc gataataagg 120 

cagcctgacg gtacgccgtt aaaggactca cagggcagaa tcacctctca cctttccgga 18 0 

atcctataca gagtctccaa catggtcgag gtgggaatca ggccggtgtt tgtattcgac 240 

ggagagccac cggagttcaa gaaggctgaa attgaggaga ggaaaaagag aagggctgag 3 00 

gcagaggaga tgtggattgc ggctttgcag gcaggagata aggacgcgaa aaagtatgct 360 

caggctgcag ggagggttga cgagtacatt gttgactccg caaagacgct tttaagttac 42 0 

^tggggattc cctttgtcga tgccccgtct gaaggagagg cgcaggctgc ttacatggca 480 

gcaaaaggcg atgtggagta cacaggaagc caggattacg attctctgct cttcggaagc 54 0 

ccgagactcg ccagaaatct cgcaataacg ggaaaaagga agcttcccgg caaaaatgtc 600 

tatgtggatg taaagccgga gataataatt ctggaaagca acctcaaaag gctgggtttg 660 

acgagggagc agctcatcga catagcgatt ctggtcggga cggactacaa tgagggtgtg 720 

sagggtgtcg gcgtcaagaa ggctttgaac tacatcaaga cctacggaga tattttcagg 7 80 

gcactcaagg ctctgaaagt aaatattgac cacgtagagg agataaggaa tttcttcctg 84 0 

aatcctcctg tgactgacga ctacagaata gagttcaggg agcctgactt tgagaaggcc 900 
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atcgagttcc 


tgtgcgagga 


gcacgacttc 


agcagggaga 


gggtcgagaa 


ggccttggag 


960 


aagctcaaag 


ctctgaagtc 


aacccaggcc 


acgcttgaga ggtggttcct ggaggaggcc 


1020 


ccctggcccc 


cgccggaagg 


ggccttcgtg 


ggcttcgtcc 


tctcccgccc 


cgagcccatg 


1080 


tgggcggagc 


ttaaagccct 


ggccgcctgc 


aggggcggcc 


gcgtccaccg 


ggcccccgag 


1140 


ccttataaag 


ccctcaggga 


cctgaaggag 


gcgcgggggc 


ttctcgccaa 


agacctgagc 


1200 


gttctggccc 


tgagggaagg 


ccttggcctc 


ccgcccggcg 


acgaccccat 


gctcctcgcc 


1260 


tacctcctgg 


acccttcgaa 


caccaccccc 


gagggggtgg 


cccggcgcta 


cggcggggag 


1320 


tggacggagg 


aggcggggga 


gcgggccgcc 


ctttccgaga 


ggctcttcgc 


caacctgctt 


1380 


aagaggcttg 


agggggagga 


gaggctcctt 


tggctttacc 


gggaggtgga 


gaggcccctt 


1440 


tccgctgtcc 


tggcccatat ggaggccacg 


ggggtgcgcc 


tggacgtggc 


ctatctcagg 


1500 


gccttgtccc 


tggaggtggc 


cgaggagatc 


gcccgcctcg 


aggccgaggt 


cttccgcctg 


1560 


gccggccacc 


ccttcaacct 


caactcccgg 


gaccagctgg 


aaagggtcct 


ctttgacgag 


1620 


ctagggcttc 


ccgccatcaa 


gaagacgcaa 


aagaccggca 


agcgctccac 


cagcgccgcc 


1680 


gtcctggagg 


ccctccgcga 


ggcccacccc 


atcgtggaga 


agatcctgca 


gtaccgggag 


1740 


ctcaccaagc 


tgaagagcac 


ctacattgac 


cccttgccgg 


acctcatcca 


ccccaggacg 


1800 


ggccgcctcc 


acacccgctt 


caaccagacg 


gccacggcca 


cgggcaggct 


aagtagctcc 


1860 


gatcccaacc 


tccagaacat 


ccccgtccgc 


accccgcttg ggcagaggat 


ccgccgggcc 


1920 


ttcatcgccg 


aggaggggtg 


gctattggtg 


gccctggact 


atagccagat 


agagctcagg 


1980 


gtgctggccc 


acctctccgg 


cgacgagaac 


ctgatccggg 


tcttccagga 


ggggcgggac 


2040 


atccacacgg 


agaccgccag 


ctggatgttc 


ggcgtccccc 


gggaggccgt 


ggaccccctg 


2100 


atgcgccggg 


cggccaagac 


catcaacttc 


ggggtcctct 


acggcatgtc 


ggcccaccgc 


2160 


ctctcccagg 


agctagccat 


cccttacgag 


gaggcccagg 


ccttcattga gcgctacttt 


2220 


cagagcttcc 


ccaaggtgcg 


ggcctggatt 


gagaagaccc 


tggaggaggg 


caggaggcgg 


2280 


gggtacgtgg 


agaccctctt 


cggccgccgc 


cgctacgtgc 


cagacctaga 


ggcccgggtg 


2340 


aagagcgtgc 


gggaggcggc 


cgagcgcatg 


gccttcaaca 


tgcccgtcca 


gggcaccgcc 


2400 


gccgacctca 


tgaagctggc 


tatggtgaag 


ctcttcccca 


ggctggagga 


aatgggggcc 


2460 


aggatgctcc 


ttcaggtcgc 


caacgagctg 


gtcctcgagg 


ccccaaaaga 


gagggcggag 


2520 


gccgtggccc 


ggctggccaa 


ggaggtcatg 


gagggggtgt 


atcccctggc 


cgtgcccctg 


2580 


gaggtggagg tggggatagg ggaggactgg ctctccgcca aggagcacca 


, ccaccaccac 


2640 
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cac 



<210> 566 

<211> 881 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 566 

Met Gly Ala Asp lie Gly Asp Leu Phe Glu Arg Glu Glu Val Glu Leu 
15 10 15 

Glu Tyr Phe Ser Gly Lys Lys lie Ala Val Asp Ala Phe Asn Thr Leu 
20 25 30 

Tyr Gin Phe He Ser He He Arg Gin Pro Asp Gly Thr Pro Leu Lys 
35 40 45 

Asp Ser Gin Gly Arg He Thr Ser His Leu Ser Gly He Leu Tyr Arg 
50 55 60 

Val Ser Asn Met Val Glu Val Gly He Arg Pro Val Phe Val Phe Asp 
65 70 75 80 

Gly Glu Pro Pro Glu Phe Lys Lys Ala Glu He Glu Glu Arg Lys Lys 
85 90 95 

Arg Arg Ala Glu Ala Glu Glu Met Trp He Ala Ala Leu Gin Ala Gly 
100 105 110 

Asp Lys Asp Ala Lys Lys Tyr Ala Gin Ala Ala Gly Arg Val Asp Glu 
115 120 125 

Tyr He Val Asp Ser Ala Lys Thr Leu Leu Ser Tyr Met Gly He Pro 
130 135 140 

Phe Val Asp Ala Pro Ser Glu Gly Glu Ala Gin Ala Ala Tyr Met Ala 
145 150 155 160 

Ala Lys Gly Asp Val Glu Tyr Thr Gly Ser Gin Asp Tyr Asp Ser Leu 
165 170 175 

Leu Phe Gly Ser Pro Arg Leu Ala Arg Asn Leu Ala He Thr Gly Lys 
180 185 190 

Arg Lys Leu Pro Gly Lys Asn Val Tyr Val Asp Val Lys Pro Glu He 
195 200 205 

He He Leu Glu Ser Asn Leu Lys Arg Leu Gly Leu Thr Arg Glu Gin 
210 215 220 

Leu He Asp He Ala He Leu Val Gly Thr Asp Tyr Asn Glu Gly Val 
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225 230 235 240 

Lys Gly Val Gly Val Lys hys Ala Leu Asn Tyr lie hys Thr Tyr Gly 
245 250 255 

Asp lie Phe Arg Ala Leu Lys Ala Leu Lys Val Asn lie Asp His Val 
260 265 270 

Glu Glu lie Arg Asn Phe Phe Leu Asn Pro Pro Val Thr Asp Asp Tyr 
275 280 285 

Arg He Glu Phe Arg Glu Pro Asp Phe Glu Lys Ala He Glu Phe Leu 
290 295 300 

Cys Glu Glu His Asp Phe Ser Arg Glu Arg Val Glu Lys Ala Leu Glu 
305 310 315 320 

Lys Leu Lys Ala Leu Lys Ser Thr Gin Ala Thr Leu Glu Arg Trp Phe 
325 330 335 

Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Phe 
340 345 350 

Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu Leu Lys Ala Leu Ala 
355 360 365 

Ala Cys Arg Gly Gly Arg Val His Arg Ala Pro Glu Pro Tyr Lys Ala 
370 375 380 

Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu Leu Ala Lys Asp Leu Ser 
385 390 395 400 

Val Leu Ala Leu Arg Glu Gly Leu Gly Leu Pro Pro Gly Asp Asp Pro 
405 410 415 

Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly 
420 425 430 

Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu Glu Ala Gly Glu Arg 
435 440 445 

Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn Leu Leu Lys Arg Leu Glu 
450 455 460 

Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg Glu Val Glu Arg Pro Leu 
465 470 475 480 

Ser Ala Val Leu Ala His Met Glu Ala Thr Gly Val Arg Leu Asp Val 
485 490 495 

Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val Ala Glu Glu He Ala Arg 
500 505 510 

Leu Glu Ala Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn 
515 520 525 

Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu Gly Leu Pro 
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530 



535 



540 



Ala lie Lys Lys Thr Gin Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala 
545 550 555 560 

Val Leu Glu Ala Leu Arg Glu Ala His Pro lie Val Glu Lys lie Leu 
565 570 575 

Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr Tyr lie Asp Pro Leu 
580 585 590 

Pro Asp Leu He His Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn 
595 600 605 

Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu 
610 615 620 

Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin Arg He Arg Arg Ala 
625 630 635 640 

Phe He Ala Glu Glu Gly Trp Leu Leu Val Ala Leu Asp Tyr Ser Gin 
645 650 655 

He Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu He 
660 665 670 

Arg Val Phe Gin Glu Gly Arg Asp He His Thr Glu Thr Ala Ser Trp 
675 680 685 

Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro Leu Met Arg Arg Ala 
690 695 700 

Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg 
705 710 715 720 

Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu Ala Gin Ala Phe He 
725 730 735 

Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys 
740 745 750 

Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val Glu Thr Leu Phe Gly 
755 760 765 

Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg Val Lys Ser Val Arg 
770 775 780 

Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala 
785 790 795 800 

Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu Glu 
805 810 815 

Glu Met Gly Ala Arg Met Leu Leu Gin Val Ala Asn Glu Leu Val Leu 
820 825 830 



Glu Ala Pro Lys Glu Arg Ala Glu Ala Val Ala Arg Leu Ala Lys Glu 
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835 840 845 

Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val 
850 855 860 

Gly lie Gly Glu Asp Trp Leu Ser Ala Lys Glu His His His His His 
865 870 875 880 

His 



<210> 567 

<211> 2643 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 567 

atgggtgcgg atattggtga cctctttgag agggaagagg tcgagcttga gtacttctca 



60 



ggaaagaaaa ttgccgttga tgctttcaac acgctatacc agttcatctc gataataagg 12 0 

cagcctgacg gtacgccgtt aaaggactca cagggcagaa tcacctctca cctttccgga 180 

atcctataca gagtctccaa catggtcgag gtgggaatca ggccggtgtt tgtattcgac 24 0 

ggagagccac cggagttcaa gaaggctgaa attgaggaga ggaaaaagag aagggctgag 3 00 

gcagaggaga tgtggattgc ggctttgcag gcaggagata aggacgcgaa aaagtatgct 360 

caggctgcag ggagggttga cgagtacatt gttgactccg caaagacgct tttaagttac 42 0 

atggggattc cctttgtcga tgccccgtct gaaggagagg cgcaggctgc ttacatggca 480 

gcaaaaggcg atgtggagta cacaggaagc caggattacg attctctgct cttcggaagc 54 0 
ccgagactcg ccagaaatct cgcaataacg ggaaaaagga agcttcccgg caaaaatgtc 
tatgtggatg taaagccgga gataataatt ctggaaagca acctcaaaag gctgggtttg 

acgagggagc agctcatcga catagcgatt ctggtcggga cggactacaa tgagggtgtg 720 

aagggtgtcg gcgtcaagaa ggctttgaac tacatcaaga cctacggaga tattttcagg 78 0 
gcactcaagg ctctgaaagt aaatattgac cacgtagagg agataaggaa tttcttcctg 
aatcctcctg tgactgacga ctacagaata gagttcaggg agcctgactt tgagaaggcc 
atcgagttcc tgtgcgagga gcacgacttc agcagggaga gggtcgagaa ggccttggag 960 

aagctcaaag ctctgaagtc aacccaggcc acgcttgaga ggtggttcct ggaggaggcc 102 0 

ccctggcccc cgccggaagg ggccttcgtg ggcttcgtcc tctcccgccc cgagcccatg 1080 

tgggcggagc ttaaagccct ggccgcctgc aggggcggcc gcgtccaccg ggcccccgag 114 0 



600 
660 



840 
900 
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ccttataaag ccctcaggga cctgaaggag gcgcgggggc ttctcgccaa agacctgagc 12 00 

gttctggccc tgagggaagg ccttggcctc ccgcccggcg acgaccccat gctcctcgcc 1260 

tacctcctgg acccttcgaa caccaccccc gagggggtgg cccggcgcta cggcggggag 1320 

tggacggagg aggcggggga gcgggccgcc ctttccgaga ggctcttcgc caacctgtgg 13 80 

gggaggcttg agggggagga gaggctcctt tggctttacc gggaggtgga gaggcccctt 1440 

tccgctgtcc tggcccatat ggaggccacg ggggtgcgcc tggacgtggc ctatctcagg 1500 

gccttgtccc tggaggtggc cgaggagatc gcccgcctcg aggccgaggt cttccgcctg 1560 

gccggccacc ccttcaacct caactcccgg gaccagctgg aaagggtcct ctttgacgag 1620 

ctagggcttc ccgccatcgg caagacggag aagaccggca agcgctccac cagcgccgcc 1680 

gtcctggagg ccctccgcga ggcccacccc atcgtggaga agatcctgca gtaccgggag 174 0 

ctcaccaagc tgaagagcac ctacattgac cccttgccgg acctcatcca ccccaggacg 1800 

ggccgcctcc acacccgctt caaccagacg gccacggcca cgggcaggct aagtagctcc 1860 

gatcccaacc tccagaacat ccccgtccgc accccgcttg ggcagaggat ccgccgggcc 192 0 

ttcatcgccg aggaggggtg gctattggtg gccctggact atagccagat agagctcagg 1980 

gtgctggccc acctctccgg cgacgagaac ctgatccggg tcttccagga ggggcgggac 2040 

atccacacgg agaccgccag ctggatgttc ggcgtccccc gggaggccgt ggaccccctg 2100 

atgcgccggg cggccaagac catcaacttc ggggtcctct acggcatgtc ggcccaccgc 2160 

ctctcccagg agctagccat cccttacgag gaggcccagg ccttcattga gcgctacttt 222 0 

cagagcttcc ccaaggtgcg ggcctggatt gagaagaccc tggaggaggg caggaggcgg 22 80 

gggtacgtgg agaccctctt cggccgccgc cgctacgtgc cagacctaga ggcccgggtg 234 0 

aagagcgtgc gggaggcggc cgagcgcatg gccttcaaca tgcccgtcca gggcaccgcc 2400 

gccgacctca tgaagctggc tatggtgaag ctcttcccca ggctggagga aatgggggcc 2460 

aggatgctcc ttcaggtcca caacgagctg gtcctcgagg ccccaaaaga gagggcggag 252 0 

gccgtggccc ggctggccaa ggaggtcatg gagggggtgt atcccctggc cgtgcccctg 2580 

gaggtggagg tggggatagg ggaggactgg ctctccgcca aggagcacca ccaccaccac 2 640 

cac 2643 

<210> 568 
<211> 881 
<212> PRT 
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<213> Artificial 
<220> 

<223> Synthetic 
<400> 568 

Met Gly Ala Asp He Gly Asp Leu Phe Glu Arg Glu Glu Val Glu Leu 
1 5 10 15 

Glu Tyr Phe Ser Gly Lys Lys He Ala Val Asp Ala Phe Asn Thr Leu 
20 25 30 

Tyr Gin Phe He Ser He He Arg Gin Pro Asp Gly Thr Pro Leu Lys 
35 40 45 

Asp Ser Gin Gly Arg He Thr Ser His Leu Ser Gly He Leu Tyr Arg 
50 55 60 

Val Ser Asn Met Val Glu Val Gly He Arg Pro Val Phe Val Phe Asp 
65 70 75 80 

Gly Glu Pro Pro Glu Phe Lys Lys Ala Glu He Glu Glu Arg Lys Lys 
85 90 95 

Arg Arg Ala Glu Ala Glu Glu Met Trp He Ala Ala Leu Gin Ala Gly 
100 105 110 

Asp Lys Asp Ala Lys Lys Tyr Ala Gin Ala Ala Gly Arg Val Asp Glu 
115 120 125 

Tyr He Val Asp Ser Ala Lys Thr Leu Leu Ser Tyr Met Gly He Pro 
130 135 140 

Phe Val Asp Ala Pro Ser Glu Gly Glu Ala Gin Ala Ala Tyr Met Ala 
145 150 155 160 

Ala Lys Gly Asp Val Glu Tyr Thr Gly Ser Gin Asp Tyr Asp Ser Leu 
165 170 175 

Leu Phe Gly Ser Pro Arg Leu Ala Arg Asn Leu Ala He Thr Gly Lys 
180 185 190 

Arg Lys Leu Pro Gly Lys Asn Val Tyr Val Asp Val Lys Pro Glu He 
195 200 205 

He He Leu Glu Ser Asn Leu Lys Arg Leu Gly Leu Thr Arg Glu Gin 
210 215 220 

Leu He Asp He Ala He Leu Val Gly Thr Asp Tyr Asn Glu Gly Val 
225 230 235 240 

Lys Gly Val Gly Val Lys Lys Ala Leu Asn Tyr He Lys Thr Tyr Gly 
245 250 255 

Asp He Phe Arg Ala Leu Lys Ala Leu Lys Val Asn He Asp His Val 
260 265 270 
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Glu Glu lie Arg Asn Phe Phe Leu Asn Pro Pro Val Thr Asp Asp Tyr 
275 280 285 

Arg lie Glu Phe Arg Glu Pro Asp Phe Glu Lys Ala lie Glu Phe Leu 
290 295 300 

Cys Glu Glu His Asp Phe Ser Arg Glu Arg Val Glu Lys Ala Leu Glu 
305 310 315 320 

Lys Leu Lys Ala Leu Lys Ser Thr Gin Ala Thr Leu Glu Arg Trp Phe 
325 330 335 

Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Phe 
340 345 350 

Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu Leu Lys Ala Leu Ala 
355 360 365 

Ala Cys Arg Gly Gly Arg Val His Arg Ala Pro Glu Pro Tyr Lys Ala 
370 375 380 

Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu Leu Ala Lys Asp Leu Ser 
385 390 395 400 

Val Leu Ala Leu Arg Glu Gly Leu Gly Leu Pro Pro Gly Asp Asp Pro 
405 410 415 

Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly 
420 425 430 

Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu Glu Ala Gly Glu Arg 
435 440 445 

Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn Leu Trp Gly Arg Leu Glu 
450 455 460 

Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg Glu Val Glu Arg Pro Leu 
465 470 475 480 

Ser Ala Val Leu Ala His Met Glu Ala Thr Gly Val Arg Leu Asp Val 
485 490 495 

Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val Ala Glu Glu lie Ala Arg 
500 505 510 

Leu Glu Ala Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn 
515 520 525 

Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu Gly Leu Pro 
530 535 540 

Ala He Gly Lys Thr Glu Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala 
545 550 555 560 

Val Leu Glu Ala Leu Arg Glu Ala His Pro lie Val Glu Lys He Leu 
565 570 575 
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Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr Tyr He Asp Pro Leu 
580 585 590 

Pro Asp Leu He His Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn 
595 600 605 

Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu 
610 615 620 

Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin Arg He Arg Arg Ala 
625 630 635 640 

Phe He Ala Glu Glu Gly Trp Leu Leu Val Ala Leu Asp Tyr Ser Gin 
645 650 655 

He Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu He 
660 665 670 

Arg Val Phe Gin Glu Gly Arg Asp He His Thr Glu Thr Ala Ser Trp 
675 680 685 

Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro Leu Met Arg Arg Ala 
690 695 700 

Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg 
705 710 715 720 

Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu Ala Gin Ala Phe He 
725 730 735 

Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys 
740 745 750 

Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val Glu Thr Leu Phe Gly 
755 760 765 

Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg Val Lys Ser Val Arg 
770 775 780 

Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala 
785 790 795 800 

Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu Glu 
805 810 815 

Glu Met Gly Ala Arg Met Leu Leu Gin Val His Asn Glu Leu Val Leu 
820 825 830 

Glu Ala Pro Lys Glu Arg Ala Glu Ala Val Ala Arg Leu Ala Lys Glu 
835 840 845 

Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val 
850 855 860 

Gly He Gly Glu Asp Trp Leu Ser Ala Lys Glu His His His His His 
865 870 875 880 
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His 



<210> 569 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 

<400> 569 

gagcggataa caatttcaca cagg 24 



<210> 570 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 570 

tgcccggtgc acgcggccgc ccctgcaggc 3 0 



<210> 571 

<211> 2445 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 571 

atgaattccc tgcccctctt tgagcccaag ggccgggtgc ttctggtgga cggccaccac 60 

ctggcctacc gtaccttttt tgccctgaag ggcctcacca ccagccgcgg ggagccggtc 12 0 

caggcggtgt acgggtttgc caagagcctt ttgaaggcgc taagggaaga cggggatgtg 180 

gtgatcgtgg tgtttgacgc caaggccccc tccttccgcc accagaccta cgaggcctac 240 

aaggcggggc gggctcccac ccccgaggac tttccccggc agcttgccct tatcaaggag 300 

atggtggacc ttttgggcct ggagcgcctc gaggtgccgg gctttgaagc ggatgacgtc 360 

ctggctaccc tggccaagaa ggcggaaaag gagggctacg aggtccgcat cctcaccgcc 42 0 

gacaaagacc tttaccagct cctttccgac cgcatccacg tcctccaccc cgaggggtac 480 

ctcatcaccc cggcctggct ttgggaaaag tacggcctga ggcccgacca gtgggccgac 540 

taccgggccc tgaccgggga cgagtccgac aaccttcccg gggtcaaggg catcggggag 60 0 
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840 
900 
960 
1020 
1080 
1140 



aagacggcga ggaagcttct ggaggagtgg gggagcctgg aagccctcct caagaacctg 660 

gaccggctga agcccgccat ccgggagaag atcctggccc acatggacga tctgaagctc 72 0 

tcctgggacc tggccaaggt gcgcaccgac ctgcccctgg aggtggactt cgccaaaagg 780 
cgggagcccg accgggaggg ggagaagccc cgggaggagg ccccctggcc cccgcccgaa 
ggggccttcg tgggcttcct cctttcccgc cccgagccca tgtgggcgga gcttaaagcc 
ctggccgcct gcaggggcgg ccgcgtgcac cgggcagcag accccttggc ggggctaaag 
gacctcaagg aggtccgggg cctcctcgcc aaggacctcg ccgtcttggc ctcgagggag 
gggctagacc tcgtgcccgg ggacgacccc atgctcctcg cctacctcct ggacccttcg 
aacaccaccc ccgagggggt ggcgcggcgc tacggggggg agtggacgga ggacgccgcc 

caccgggccc tcctctcgga gaggctccat cggaacctcc ttaagcgcct cgagggggag 1200 

gagaagctcc tttggctcta ccacgaggtg gaaaagcccc tctcccgggt cctggcccat 1260 

atggaggcca ccggggtacg gcgggacgtg gcctaccttc aggccctttc cctggagctt 1320 
gcggaggaga tccgccgcct cgaggaggag gtcttccgct tggcgggcca ccccttcaac 
ctcaactccc gggaccagct ggaaagggtg ctctttgacg agcttaggct tcccgccttg 
aagaagacga agaagacagg caagcgctcc accagcgccg cggtgctgga ggccctacgg 

gaggcccacc ccatcgtgga gaagatcctc cagcaccggg agctcaccaa gctcaagaac 1560 

acctacgtgg accccctccc aagcctcgtc cacccgagga cgggccgcct ccacacccgc 1620 

ttcaaccaga cggccacggc cacggggagg cttagtagct ccgaccccaa cctgcagaac 1680 

atccccgtcc gcaccccctt gggccagagg atccgccggg ccttcgtggc cgaggcgggt 174 0 

tgggcgttgg tggccctgga ctatagccag atagagctcc gcgtcctcgc ccacctctcc 1800 

ggggacgaaa acctgatcag ggtcttccag gaggggaagg acatccacac ccagaccgca 1860 

agctggatgt tcggcgtccc cccggaggcc gtggaccccc tgatgcgccg ggcggccaag 1920 

acggtgaact tcggcgtcct ctacggcatg tccgcccata ggctctccca ggagcttgcc 198 0 

atcccctacg aggaggcggt ggcctttata gagcgctact tccaaagctt ccccaaggtg 2040 

cgggcctgga tagaaaagac cctggaggag gggaggaagc ggggctacgt ggaaaccctc 2100 

ttcggaagaa ggcgctacgt gcccgacctc aacgcccggg tgaagagcgt cagggaggcc 2160 

gcggagcgca tggccttcaa catgcccgtc cagggcaccg ccgccgacct catgaagctc 2220 

gccatggtga agctcttccc ccgcctccgg gagatggggg cccgcatgct cctccaggtc 22 80 

gccaacgagc tcctcctgga ggccccccaa gcgcgggccg aggaggtggc ggctttggcc 2340 



1380 
1440 
1500 
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aaggaggcca tggagaaggc ctatcccctc gccgtgcccc tggaggtgga ggtggggatg 24 00 
ggggaggact ggctttccgc caagggtcac caccaccacc accac 244 5 



<210> 572 

<211> 815 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 572 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu lie Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro Asp 
165 170 175 

Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn Leu 
180 185 190 

Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu 
195 200 205 

Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu Lys 
210 215 220 
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Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu Lys Leu 
225 230 235 240 



Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu Val Asp 
245 250 255 

Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Gly Glu Lys Pro Arg Glu 
260 265 270 

Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu 
275 280 285 

Ser Arg Pro Glu Pro Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys 
290 295 300 

Arg Gly Gly Arg Val His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys 
305 310 315 320 

Asp Leu Lys Glu Val Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu 
325 330 335 

Ala Ser Arg Glu Gly Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu 
340 345 350 

Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala 
355 360 365 

Arg Arg Tyr Gly Gly Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu 
370 375 380 

Leu Ser Glu Arg Leu His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu 
385 390 395 400 

Glu Lys Leu Leu Trp Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg 
405 410 415 

Val Leu Ala His Met Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr 
420 425 430 

Leu Gin Ala Leu Ser Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu 
435 440 445 

Glu Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg 
450 455 460 

Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu 
465 470 475 480 

Lys Lys Thr Lys Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu 
485 490 495 

Glu Ala Leu Arg Glu Ala His Pro lie Val Glu Lys lie Leu Gin His 
500 505 510 



Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser 
515 520 525 



1038 



Leu Val His Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr 
530 535 540 

Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn 
545 550 555 560 

lie Pro Val Arg Thr Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val 
565 570 575 

Ala Glu Ala Gly Trp Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu 
580 585 590 

Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu lie Arg Val 
595 600 605 

Phe Gin Glu Gly Lys Asp lie His Thr Gin Thr Ala Ser Trp Met Phe 
610 615 620 

Gly Val Pro Pro Glu Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys 
625 630 635 640 

Thr Val Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser 
645 650 655 

Gin Glu Leu Ala lie Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg 
660 665 670 

Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu 
675 680 685 

Glu Glu Gly Arg Lys Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg 
690 695 700 

Arg Tyr Val Pro Asp Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala 
705 710 715 720 

Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp 
725 730 735 

Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met 
740 745 750 

Gly Ala Arg Met Leu Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala 
755 760 765 

Pro Gin Ala Arg Ala Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met 
770 775 780 

Glu Lys Ala Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val Gly Met 
785 790 795 800 

Gly Glu Asp Trp Leu Ser Ala Lys Gly His His His His His His 
805 810 815 



<210> 573 
<211> 2520 
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<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 573 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 



ctcaccgccg accgcgacct ctaccaactc gtctccgacc gcatccacgt cctccacccc 
tcatcacccc ggagtggctt tgggagaagt atgggcttaa gccttcccag 540 



120 
180 



ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct caaggaggac 

ggggacgcgg tgatcgtggt ctttgacgcc aaggccccct ccttccgcca cgaggcctac 240 

ggggggtaca aggcgggccg ggcccccacc ccggaggact tcccccgcca gctcgccttg 3 00 

gtcaagcggc tggtggacct tctgggcctg gtccgcctcg aggccccggg gtacgaggcg 360 

gacgacgtcc tgggcaccct ggccaagaag gccgaaaagg aggggtacga ggtgcgcatc 420 



480 



600 



tgggtggact accgggcctt ggccggggac ccttccgaca acatccccgg cgtgaagggc 

atcggggaga agacggcggc caagctgatc cgggagtggg gaagcctgga aaacctcctc 660 

aagaacctgg accggctgaa gcccgccatc cgggagaaga tcctggccca catggacgat 72 0 

ctgaagctct cctgggacct ggccaaggtg cgcaccgacc tgcccctgga ggtggacttc 780 

gccaaaaggc gggagcccga ccgggagagg cttagggcct ttctggagag gcttgagttt 84 0 

ggcagcctcc tccacgagtt cggccttctg gaaagcccca aggccctgga ggaggccccc 900 

tggcccccgc cggaaggggc cttcgtgggc ttcgtcctct cccgccccga gcccatgtgg 960 

gcggagctta aagccctggc cgcctgcagg ggcggccgcg tgcaccgggc agcagacccc 102 0 

ttggcggggc taaaggacct caaggaggtc cggggcctcc tcgccaagga cctcgccgtc 10 80 

ttggcctcga gggaggggct agacctcgtg cccggggacg accccatgct cctcgcctac 1140 

ctcctggacc cttcgaacac cacccccgag ggggtggcgc ggcgctacgg gggggagtgg 1200 

acggaggacg ccgcccaccg ggccctcctc tcggagaggc tccatcggaa cctccttaag 1260 

cgcctcgagg gggaggagaa gctcctttgg ctctaccacg aggtggaaaa gcccctctcc 1320 

cgggtcctgg cccatatgga ggccaccggg gtacggcggg acgtggccta ccttcaggcc 13 80 

ctttccctgg agcttgcgga ggagatccgc cgcctcgagg aggaggtctt ccgcttggcg 1440 

ggccacccct tcaacctcaa ctcccgggac cagctggaaa gggtgctctt tgacgagctt 150 0 

aggcttcccg ccttgaagaa gacgaagaag acaggcaagc gctccaccag cgccgcggtg 1560 
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ctggaggccc tacgggaggc ccaccccatc gtggagaaga tcctccagca ccgggagctc 1620 
accaagctca agaacaccta cgtggacccc ctcccaagcc tcgtccaccc gaggacgggc 1680 
cgcctccaca cccgcttcaa ccagacggcc acggccacgg ggaggcttag tagctccgac 
cccaacctgc agaacatccc cgtccgcacc cccttgggcc agaggatccg ccgggccttc 
gtggccgagg cgggttgggc gttggtggcc ctggactata gccagataga gctccgcgtc 
ctcgcccacc tctccgggga cgaaaacctg atcagggtct tccaggaggg gaaggacatc 
cacacccaga ccgcaagctg gatgttcggc gtccccccgg aggccgtgga ccccctgatg 
cgccgggcgg ccaagacggt gaacttcggc gtcctctacg gcatgtccgc ccataggctc 
tcccaggagc ttgccatccc ctacgaggag gcggtggcct ttatagagcg ctacttccaa 
agcttcccca aggtgcgggc ctggatagaa aagaccctgg aggaggggag gaagcggggc 
tacgtggaaa ccctcttcgg aagaaggcgc tacgtgcccg acctcaacgc ccgggtgaag 
agcgtcaggg aggccgcgga gcgcatggcc ttcaacatgc ccgtccaggg caccgccgcc 
gacctcatga agctcgccat ggtgaagctc ttcccccgcc tccgggagat gggggcccgc 
atgctcctcc aggtcgccaa cgagctcctc ctggaggccc cccaagcgcg ggccgaggag 
gtggcggctt tggccaagga ggccatggag aaggcctatc ccctcgccgt gcccctggag 
gtggaggtgg ggatggggga ggactggctt tccgccaagg gtcaccacca ccaccaccac 



<210> 574 

<211> 840 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 574 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val 
50 55 60 

He Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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65 



70 



75 



80 



Gly Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg 
85 90 95 

Gin Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg 
100 105 110 

Leu Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala 
115 120 125 

Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp 
130 135 140 

Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg He His Val Leu His Pro 
145 150 155 160 

Glu Gly Tyr Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu 
165 170 175 

Lys Pro Ser Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser 
180 185 190 

Asp Asn He Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Ala Lys 
195 200 205 

Leu He Arg Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp 
210 215 220 

Arg Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp 
225 230 235 240 

Leu Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg 
260 265 270 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro 
290 295 300 

Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met Trp 
305 310 315 320 

Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val His Arg 
325 330 335 

Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 
340 345 350 

Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 
355 360 365 

Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 
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370 



375 



380 



Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp 
385 390 395 400 

Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg 
405 410 415 

Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 
450 455 460 

Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 
515 520 525 

Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys 
530 535 540 

Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 
595 600 605 

Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie 
625 630 635 640 

His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val 
645 650 655 

Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 



Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr 
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675 680 685 

Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 
725 730 735 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 
770 775 780 

Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
785 790 795 800 

Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 
820 825 830 

Lys Gly His His His His His His 
835 840 

<210> 575 

<211> 2445 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 575 

atgaattccc tgcccctctt tgagcccaag ggccgggtgc ttctggtgga cggccaccac 60 
ctggcctacc gtaccttttt tgccctgaag ggcctcacca ccagccgcgg ggagccggtc 12 0 
caggcggtgt acgggtttgc caagagcctt ttgaaggcgc taagggaaga cggggatgtg 180 
gtgatcgtgg tgtttgacgc caaggccccc tccttccgcc accagaccta cgaggcctac 
aaggcggggc gggctcccac ccccgaggac tttccccggc agcttgccct tatcaaggag 
atggtggacc ttttgggctt tacccgcctc gaggtgccgg gctttgaagc ggatgacgtc 
ctggctaccc tggccaagaa ggcggaaaag gagggctacg aggtccgcat cctcaccgcc 42 0 
gacaaagacc tttaccagct cctttccgac cgcatccacg tcctccaccc cgaggggtac 



240 
300 
360 



480 
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ctcatcaccc 


cqqcctqqct 


ttgggaaaag 


tacggcctga 


ggcccgacca 


gtgggccgac 


540 




taccgggccc 


tgaccgggga 


cgagtccgac 


aaccttcccg 


gggtcaaggg 


catcggggag 


600 




aagacggcga 


ggaagcttct 


qqaqqaatqa 


qqqaqc c t qq 

333 ""3 ^ ^ 3 3 


aagccctcct 


caagaacctg 


660 




gaccggctga 


agcccgccat 


CCqqqaqaaq 
^^333 33 


atcctggccc 


acatggacga 


tctgaagctc 


720 




tcctgggacc 


tggccaaggt 


gcgcaccgac 


CtqccCCtqq 


aggtggactt 


cgccaaaagg 


780 




cqqqaqcccq 


accqqqaqqq 


gga-gaa-gccc 


c qqqaqq agq 
333 33 33 


ccccctggcc 


cccgcccgaa 


840 




ggggccttcg 


tgggcttcct 


cctttcccgc 


cccgagccca 


tgtgggcgga 


gcttaaagcc 


900 




ctqqccqcct 


qcaqqqqcqq 


ccgcgtgcac 


cqqqcaqcaq 

333 ,1-' --' 


accccttggc 


ggggctaaag 


960 




gacctcaagg 


a QQ t c c o Qcrq 

<^3 3 3 33 3 


cctcctcgcc 


aaggacctcg 


ccgtcttggc 


ct cqaqqqaq 


1020 




gg-gc t aga c c 


tcgfcgcccgg 


ggacgacccc 


atgctcctcg 


cctacctcct 


aaacccttcq 


1080 




a.B.csLCCciCcc 


c CQaqqaqot 

'^'-3"-^3::533^*- 


qqcqCQQCqc 


tacOQQqqqq 


aqtqqacqqa 


qqacqccqcc 

33"-*'^3'^"^3'^" 


1140 




caccgggccc 


tcctctcgga 


gaggctccat 


cggaacctcc 


ttaagcgcct 


cqaqqqqqaq 
3333333 


1200 




gagaagctcc 


tttggctcta 


ccacgaggtg 


gaaaagcccc 


tctcccqqqt 


cctggcccat 


1260 


o 


^tggaggcca 


ccggggtacg 


acQqqaCQtQ 


gcctaccttc 


aggccctttc 


cctggagctt 


1320 


H 1 


gcggaggaga 


tccgccgcct 


c a a era a cf cr a a 


gtcttccgct 




ccccttcaac 


1380 




ctcaactccc 


gggaccagct 


aoaaaaacrtQ 


ctctttgacg 


agcttaggct 


tcccgccttg 


1440 




aagaagacga 


aqaaqacaqq 

^^3 ^^^^3 ^^33 


caagcgctcc 


accagcgccg 


cqqtqctqqa 

^33 '-3 ^33"- 


qqccctacqq 


1500 




gaggcccacc 


ccatcgtgga 


gaagatcctc 


cagcaccggg 


agctcaccaa 


gctcaagaac 


1560 




acctacgtgg 


accccctccc 


aagcctcgtc 


cacccgagga 


cqaqccqcct 


ccacacccgc 


1620 




t~ t~ raa pcaaa 

^ V_r ^ fc*. 


c cf cr c c a c era p 


^civ^yy yy ciyy 


r* 1" t* a at" a ac t" 


ccaaccccaa 


cctgcagaac 


1680 




O. U. "s-. \^ y L-k^v^ 




yyyL-v—ciyciyy 


dt— u.L^yv.'t-yyy 




^y c^yy ^yy y 


1740 






L-yy ^(-L- cyy d 


\^ C-Ci L- Ay ocLy 


cL udy a.y v.^ u-^t^ 


y^^y I— ^^i_^y\j> 




1800 




^ y y ^ 3 dCJ-Cl 




yy n^i^v—v-'V^ ^y 


y *^yy yy *^*^y y 


CL I— ^ CI ^ 0. 


V— ' * — ' Ci ^-^ C*- V — V-^ CJ. 


1860 




agctgga tgt 


tcggcgtccc 


^ ^ ^ ^^^^ 3 ^ ^ 


y ^ ^ ^ ^ 


*- 3 3 ^3 ^^3 


aacaaccaaa 


1920 




a.cggtgaact 


tcggcgt cct 


ctacggcatg 


tccgcccafca 


ggctctccca 


ggagcfc tgcc 


1980 




atcccctacg 


aggaggcggt 


ggcctttata 


gagcgctact 


tccaaagctt 


ccccaaggtg 


2040 




cgggcctgga 


tagaaaagac 


cctggaggag 


gggaggaagc 


ggggctacgt 


ggaaaccctc 


2100 




ttcggaagaa 


ggcgctacgt 


gcccgacctc 


aacgcccggg 


tgaagagcgt 


cagggaggcc 


2160 




gcggagcgca 


tggccttcaa 


catgcccgtc 


cagggcaccg 


ccgccgacct 


catgaagctc 


2220 



1045 



2445 



gccatggtga agctcttccc ccgcctccgg gagatggggg cccgcatgct cctccaggtc 2280 

gccaacgagc tcctcctgga ggccccccaa gcgcgggccg aggaggtggc ggctttggcc 2340 

aaggaggcca tggagaaggc ctatcccctc gccgtgcccc tggaggtgga ggtggggatg 2400 
ggggaggact ggctttccgc caagggtcac caccaccacc accac 



<210> 576 

<211> 815 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 576 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val He Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu He Lys Glu Met Val Asp Leu Leu Gly Phe Thr Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro Asp 
165 170 175 

Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn Leu 
180 185 150 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu 
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195 



200 



205 



Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu Lys 
210 215 220 

Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu Lys Leu 
225 230 235 240 

Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu Val Asp 
245 250 255 

Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Gly Glu Lys Pro Arg Glu 
260 265 270 

Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu 
275 280 285 

Ser Arg Pro Glu Pro Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys 
290 295 300 

Arg Gly Gly Arg Val His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys 
305 310 315 320 

Asp Leu Lys Glu Val Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu 
325 330 335 

Ala Ser Arg Glu Gly Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu 
340 345 350 

Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala 
355 360 365 

Arg Arg Tyr Gly Gly Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu 
370 375 380 

Leu Ser Glu Arg Leu His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu 
385 390 395 400 

Glu Lys Leu Leu Trp Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg 
405 410 415 

Val Leu Ala His Met Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr 
420 425 430 

Leu Gin Ala Leu Ser Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu 
435 440 445 

Glu Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg 
450 455 460 

Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu 
465 470 475 480 

Lys Lys Thr Lys Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu 
485 490 495 

Glu Ala Leu Arg Glu Ala His Pro lie Val Glu Lys He Leu Gin His 
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500 



505 



510 



Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser 
515 520 525 

Leu Val His Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr 
530 535 540 

Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn 
545 550 555 560 

lie Pro Val Arg Thr Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val 
565 570 575 

Ala Glu Ala Gly Trp Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu 
580 585 590 

Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu lie Arg Val 
595 600 605 

Phe Gin Glu Gly Lys Asp lie His Thr Gin Thr Ala Ser Trp Met Phe 
610 615 620 

Gly Val Pro Pro Glu Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys 
625 630 635 640 

Thr Val Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser 
645 650 655 

Gin Glu Leu Ala He Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg 
660 665 670 

Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu 
675 680 685 

Glu Glu Gly Arg Lys Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg 
690 695 700 

Arg Tyr Val Pro Asp Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala 
705 710 715 720 

Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp 
725 730 735 

Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met 
740 745 750 

Gly Ala Arg Met Leu Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala 
755 760 765 

Pro Gin Ala Arg Ala Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met 
770 775 780 

Glu Lys Ala Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val Gly Met 
785 790 795 800 



Gly Glu Asp Trp Leu Ser Ala Lys Gly His His His His His His 
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805 810 815 

<210> 577 

<211> 2520 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 577 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


caaggaggac 


180 


ggggacgcgg 


tgatcgtggt 


ctttgacgcc 


aaggccccct 


ccttccgcca 


cgaggcctac 


240 


ggggggtaca 


aggcgggccg 


ggcccccacc 


ccggaggact 


tcccccgcca 


gctcgccttg 


300 


gtcaagcggc 


tggtggacct 


tctgggcttt 


acccgcctcg 


aggccccggg 


gtacgaggcg 


360 


gacgacgtcc 


tgggcaccct 


ggccaagaag 


gccgaaaagg 


aggggtacga 


ggtgcgcatc 


420 


ctcaccgccg 


accgcgacct 


ctaccaactc 


gtctccgacc 


gcatccacgt 


cctccacccc 


480 


gaggggtacc 


tcatcacccc 


ggagtggctt 


tgggagaagt 


atgggcttaa 


gccttcccag 


540 


tgggtggact 


accgggcctt 


ggccggggac 


ccttccgaca 


acatccccgg 


cgtgaagggc 


600 


atcggggaga 


agacggcggc 


caagctgatc 


cgggagtggg 


gaagcctgga 


aaacctcctc 


660 


aagaacctgg 


accggctgaa 


gcccgccatc 


cgggagaaga 


tcctggccca 


catggacgat 


720 


ctgaagctct 


cctgggacct 


ggccaaggtg 


cgcaccgacc 


tgcccctgga 


ggtggacttc 


780 


gccaaaaggc 


gggagcccga 


ccgggagagg 


cttagggcct 


ttctggagag 


gcttgagttt 


840 


ggcagcctcc 


tccacgagtt 


cggccttctg 


gaaagcccca 


aggccctgga 


ggaggccccc 


900 


tggcccccgc 


cggaaggggc 


cttcgtgggc 


ttcgtcctct 


cccgccccga 


gcccatgtgg 


960 


gcggagctta 


aagccctggc 


cgcctgcagg 


ggcggccgcg 


tgcaccgggc 


agcagacccc 


1020 


ttggcggggc 


taaaggacct 


caaggaggtc 


cggggcctcc 


tcgccaagga 


cctcgccgtc 


1080 


ttggcctcga 


gggaggggct 


agacctcgtg 


cccggggacg 


accccatgct 


cctcgcctac 


1140 


ctcctggacc 


cttcgaacac 


cacccccgag 


ggggtggcgc 


ggcgctacgg 


gggggagtgg 


1200 


acggaggacg 


ccgcccaccg 


ggccctcctc 


tcggagaggc 


tccatcggaa 


cctccttaag 


1260 


cgcctcgagg 


gggaggagaa 


gctcctttgg 


ctctaccacg 


aggtggaaaa 


gcccctctcc 


1320 


cgggtcctgg 


cccatatgga 


ggccaccggg 


gtacggcggg 


acgtggccta 


ccttcaggcc 


1380 
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1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 



ctttccctgg agcttgcgga ggagatccgc cgcctcgagg aggaggtctt ccgcttggcg 144 0 
ggccacccct tcaacctcaa ctcccgggac cagctggaaa gggtgctctt tgacgagctt 1500 
aggcttcccg ccttgaagaa gacgaagaag acaggcaagc gctccaccag cgccgcggtg 
ctggaggccc tacgggaggc ccaccccatc gtggagaaga tcctccagca ccgggagctc 
accaagctca agaacaccta cgtggacccc ctcccaagcc tcgtccaccc gaggacgggc 
cgcctccaca cccgcttcaa ccagacggcc acggccacgg ggaggcttag tagctccgac 
cccaacctgc agaacatccc cgtccgcacc cccttgggcc agaggatccg ccgggccttc 
gtggccgagg cgggttgggc gttggtggcc ctggactata gccagataga gctccgcgtc 
ctcgcccacc tctccgggga cgaaaacctg atcagggtct tccaggaggg gaaggacatc 
cacacccaga ccgcaagctg gatgttcggc gtccccccgg aggccgtgga ccccctgatg 
cgccgggcgg ccaagacggt gaacttcggc gtcctctacg gcatgtccgc ccataggctc 
tcccaggagc ttgccatccc ctacgaggag gcggtggcct ttatagagcg ctacttccaa 
agcttcccca aggtgcgggc ctggatagaa aagaccctgg aggaggggag gaagcggggc 
tacgtggaaa ccctcttcgg aagaaggcgc tacgtgcccg acctcaacgc ccgggtgaag 
agcgtcaggg aggccgcgga gcgcatggcc ttcaacatgc ccgtccaggg caccgccgcc 22 80 
gacctcatga agctcgccat ggtgaagctc ttcccccgcc tccgggagat gggggcccgc 234 0 
atgctcctcc aggtcgccaa cgagctcctc ctggaggccc cccaagcgcg ggccgaggag 
gtggcggctt tggccaagga ggccatggag aaggcctatc ccctcgccgt gcccctggag 
gtggaggtgg ggatggggga ggactggctt tccgccaagg gtcaccacca ccaccaccac 



<210> 578 

<211> 840 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 578 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 



2400 
2460 
2520 
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Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val 
50 55 60 



lie Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr 
65 70 75 80 

Gly Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg 
85 90 95 

Gin Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Phe Thr Arg 
100 105 110 

Leu Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala 
115 120 125 

Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp 
130 135 140 

Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg lie His Val Leu His Pro 
145 150 155 160 

Glu Gly Tyr Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu 
165 170 175 

Lys Pro Ser Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser 
180 185 190 

Asp Asn lie Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Ala Lys 
195 200 205 

Leu lie Arg Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp 
210 215 220 

Arg Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp 
225 230 235 240 

Leu Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg 
260 265 270 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro 
290 295 300 

Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met Trp 
305 310 315 320 

Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val His Arg 
325 330 335 

Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 
340 345 350 
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Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 
355 360 365 



Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 
370 375 380 

Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp 
385 390 395 400 

Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg 
405 410 415 

Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 
450 455 460 

Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 
515 520 525 

Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys 
530 535 540 

Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 
595 600 605 

Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie 
625 630 635 640 

His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val 
645 650 655 
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Asp Pro Leu Met Arg Arg 
660 

Tyr Gly Met Ser Ala His 
675 

Glu Glu Ala Val Ala Phe 
690 

Val Arg Ala Trp lie Glu 
705 710 

Tyr Val Glu Thr Leu Phe 
725 

Ala Arg Val Lys Ser Val 
740 

Met Pro Val Gin Gly Thr 
755 

Lys Leu Phe Pro Arg Leu 
770 

Val Ala Asn Glu Leu Leu 
785 790 

Val Ala Ala Leu Ala Lys 
805 

Val Pro Leu Glu Val Glu 
820 

Lys Gly His His His His 
835 

<210> 579 

<211> 2445 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 579 

atgaattccc tgcccctctt tgagcccaag ggccgggtgc ttctggtgga cggccaccac 60 

ctggcctacc gtaccttttt tgccctgaag ggcctcacca ccagccgcgg ggagccggtc 12 0 

caggcggtgt acgggtttgc caagagcctt ttgaaggcgc taagggaaga cggggatgtg 180 

gtgatcgtgg tctttgacgc cgaggccccc tccttccgcc accagaccta cgaggcctac 240 

aaggcggggc gggctcccac ccccgaggac tttccccggc agcttgccct tatcaaggag 30 0 

^tggtggacc ttttgggcct ggagcgcctc gaggtgccgg gctttgaagc ggatgacgtc 3 60 



Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
665 670 

Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr 
680 685 

lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
695 700 

Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly 
715 720 

Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 
730 735 

Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
745 750 

Ala Ala Asp Leu Met Lys Leu Ala Met Val 
760 765 

Arg Glu Met Gly Ala Arg Met Leu Leu Gin 
775 780 

Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
795 800 

Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 
810 815 

Val Gly Met Gly Glu Asp Trp Leu Ser Ala 
825 830 



His His 
840 



1053 





ctggctaccc 


tggccaagaa 


ggcggaaaag 


gagggctacg 


^.ggtccgcat 


cctcaccgcc 


420 




gacaaagacc 


tttaccagct 


cctttccgac 


cgcatccacg 


tcctccaccc 


cgaggggtac 


480 




ctcatcaccc 


cggcctggct 


ttgggaaaag 


tacggcctga 


ggcccgacca 


gtgggccgac 


540 




taccgggccc 


tgaccgggga 


cgagtccgac 


aaccttcccg 


gggtcaaggg 


catcggggag 


600 




aagacggcga 


ggaagcttct 


ggaggagtgg 


gggagcctgg 


aagccctcct 


caagaacctg 


660 




gaccggctga 


agcccgccat 


ccgggagaag 


atcctggccc 


acatggacga 


tctgaagctc 


720 




tcctgggacc 


tggccaaggt 


gcgcaccgac 


ctgcccctgg 


aggtggactt 


cgccaaaagg 


780 




cgggagcccg 


accgggaggg 


ggagaagccc 


cgggaggagg 


ccccctggcc 


cccgcccgaa 


840 




ggggccttcg 


tgggcttcct 


cctttcccgc 


cccgagccca 


tgtgggcgga 


gcttaaagcc 


900 




ctggccgcct 


gcaggggcgg 


ccgcgtgcac 


cgggcagcag 


accccttggc 




960 




gacctcaagg 


aggtccgggg 


cctcctcgcc 


aaggacctcg 


ccgtcttggc 


ctcgagggag 


1020 




gggctagacc 


tcgtgcccgg 


ggacgacccc 


atgctcctcg 


cctacctcct 


ggacccttcg 


1080 


ff.. 


aacaccaccc 


ccgagggggt 


ggcgcggcgc 


tacggggggg 


^9tggacgga 


ggacgccgcc 


1140 




caccgggccc 


tcctctcgga 


gs99Ctccat 


cggaacctcc 


ttaagcgcct 


cgagggggag 


1200 


'3:1 V; 


gagaagctcc 


tttggctcta 


ccacgaggtg 


gaaaagcccc 


tctcccgggt 


cctggcccat 


1260 




atggaggcca 


ccggggtacg 


gcgggacgtg 


gcctaccttc 


aggccctttc 


cctggagctt 


1320 


u 


gcggaggaga 


tccgccgcct 


cgaggaggag 


gtcttccgct 


tggcgggcca 


ccccttcaac 


1380 




ctcaactccc 


gggaccagct 


ggaaagggtg 


ctctttgacg 


agcttaggct 


tcccgccttg 


1440 




aagaagacga 


agaagacagg 


caagcgctcc 


accagcgccg 


cggtgctgga 


ggccctacgg 


1500 




gaggcccacc 


ccatcgtgga 


gaagatcctc 


cagcaccggg 


agctcaccaa 


gctcaagaac 


1560 




acctacgtgg 


accccctccc 


aagcctcgtc 


cacccgagga 


cgggccgcct 


ccacacccgc 


1620 




ttcaaccaga 


cggccacggc 


cacggggagg 


cttagtagct 


ccgaccccaa 


cctgcagaac 


1680 




atccccgtcc 


gcaccccctt 


gggccagagg 


atccgccggg 


ccttcgtggc 


cgaggcgggt 


1740 




tgggcgttgg 


tggccctgga 


ctatagccag 


atagagctcc 


gcgtcctcgc 


ccacctctcc 


1800 




ggggacgaaa 


acctgatcag 


ggtcttccag 


gaggggaagg 


acatccacac 


ccagaccgca 


1860 




agctggatgt 


tcggcgtccc 


cc cggaggcc 


gtggaccccc 


tgatgcgccg 


ggcggccaag 


1920 




^cggtgaact 


tcggcgtcct 


ctacggcatg 


tccgcccata 


ggctctccca 


gg^gcttgcc 


1980 




atcccctacg 


aggaggcggt 


ggcctttata 


gagcgctact 


tccaaagctt 


ccccaaggtg 


2040 
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cgggcctgga tagaaaagac cctggaggag gggaggaagc ggggctacgt ggaaaccctc 2100 

ttcggaagaa ggcgctacgt gcccgacctc aacgcccggg tgaagagcgt cagggaggcc 2160 

gcggagcgca tggccttcaa catgcccgtc cagggcaccg ccgccgacct catgaagctc 2220 

gccatggtga agctcttccc ccgcctccgg gagatggggg cccgcatgct cctccaggtc 22 80 

gccaacgagc tcctcctgga ggccccccaa gcgcgggccg aggaggtggc ggctttggcc 2340 

aaggaggcca tggagaaggc ctatcccctc gccgtgcccc tggaggtgga ggtggggatg 240 0 

ggggaggact ggctttccgc caagggtcac caccaccacc accac 2445 



<210> 580 

<211> 815 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 580 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Glu Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu lie Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro Asp 
165 170 175 
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Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn Leu 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu 
195 200 205 

Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu Lys 
210 215 220 

Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu Lys Leu 
225 230 235 240 

Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu Val Asp 
245 250 255 

Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Gly Glu Lys Pro Arg Glu 
260 265 270 

Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu 
275 280 285 

Ser Arg Pro Glu Pro Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys 
290 295 300 

Arg Gly Gly Arg Val His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys 
305 310 315 320 

Asp Leu Lys Glu Val Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu 
325 330 335 

Ala Ser Arg Glu Gly Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu 
340 345 350 

Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala 
355 360 365 

Arg Arg Tyr Gly Gly Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu 
370 375 380 

Leu Ser Glu Arg Leu His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu 
385 390 395 400 

Glu Lys Leu Leu Trp Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg 
405 410 415 

Val Leu Ala His Met Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr 
420 425 430 

Leu Gin Ala Leu Ser Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu 
435 440 445 

Glu Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg 
450 455 460 

Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu 
465 470 475 480 
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Lys Lys Thr Lys Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu 
485 490 495 

Glu Ala Leu Arg Glu Ala His Pro He Val Glu Lys He Leu Gin His 
500 505 510 

Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser 
515 520 525 

Leu Val His Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr 
530 535 540 

Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn 
545 550 555 560 

He Pro Val Arg Thr Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val 
565 570 575 

Ala Glu Ala Gly Trp Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu 
580 585 590 

Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu He Arg Val 
595 600 605 

Phe Gin Glu Gly Lys Asp He His Thr Gin Thr Ala Ser Trp Met Phe 
610 615 620 

Gly Val Pro Pro Glu Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys 
625 630 635 640 

Thr Val Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser 
645 650 655 

Gin Glu Leu Ala He Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg 
660 665 670 

Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu 
675 680 685 

Glu Glu Gly Arg Lys Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg 
690 695 700 

Arg Tyr Val Pro Asp Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala 
705 710 715 720 

Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp 
725 730 735 

Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met 
740 745 750 

Gly Ala Arg Met Leu Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala 
755 760 765 

Pro Gin Ala Arg Ala Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met 
770 775 780 
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Glu Lys Ala Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val Gly Met 



<210> 581 
<211> 2445 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 581 

atgaattccc tgcccctctt tgagcccaag ggccgggtgc ttctggtgga cggccaccac 60 

ctggcctacc gtaccttttt tgccctgaag ggcctcacca ccagccgcgg ggagccggtc 12 0 

caggcggtgt acgggtttgc caagagcctt ttgaaggcgc taagggaaga cggggatgtg 180 

gtgatcgtgg tctttgacgc cgaggccccc tccttccgcc accagaccta cgaggcctac 240 

aaggcggggc gggctcccac ccccgaggac tttccccggc agcttgccct tatcaaggag 300 

atggtggacc ttttgggctt tacccgcctc gaggtgccgg gctttgaagc ggatgacgtc 360 

ctggctaccc tggccaagaa ggcggaaaag gagggctacg aggtccgcat cctcaccgcc 420 

gacaaagacc tttaccagct cctttccgac cgcatccacg tcctccaccc cgaggggtac 480 

ctcatcaccc cggcctggct ttgggaaaag tacggcctga ggcccgacca gtgggccgac 54 0 

taccgggccc tgaccgggga cgagtccgac aaccttcccg gggtcaaggg catcggggag 600 

aagacggcga ggaagcttct ggaggagtgg gggagcctgg aagccctcct caagaacctg 660 

gaccggctga agcccgccat ccgggagaag atcctggccc acatggacga tctgaagctc 72 0 

tcctgggacc tggccaaggt gcgcaccgac ctgcccctgg aggtggactt cgccaaaagg 780 

cgggagcccg accgggaggg ggagaagccc cgggaggagg ccccctggcc cccgcccgaa 84 0 

ggggccttcg tgggcttcct cctttcccgc cccgagccca tgtgggcgga gcttaaagcc 900 

ctggccgcct gcaggggcgg ccgcgtgcac cgggcagcag accccttggc ggggctaaag 96 0 

gacctcaagg aggtccgggg cctcctcgcc aaggacctcg ccgtcttggc ctcgagggag 102 0 

gggctagacc tcgtgcccgg ggacgacccc atgctcctcg cctacctcct ggacccttcg 1080 

aacaccaccc ccgagggggt ggcgcggcgc tacggggggg agtggacgga ggacgccgcc 114 0 

caccgggccc tcctctcgga gaggctccat cggaacctcc ttaagcgcct cgagggggag 12 00 

gagaagctcc tttggctcta ccacgaggtg gaaaagcccc tctcccgggt cctggcccat 12 60 
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atggaggcca ccggggtacg gcgggacgtg gcctaccttc aggccctttc cctggagctt 1320 

gcggaggaga tccgccgcct cgaggaggag gtcttccgct tggcgggcca ccccttcaac 13 80 

ctcaactccc gggaccagct ggaaagggtg ctctttgacg agcttaggct tcccgccttg 1440 

aagaagacga agaagacagg caagcgctcc accagcgccg cggtgctgga ggccctacgg 150 0 

gaggcccacc ccatcgtgga gaagatcctc cagcaccggg agctcaccaa gctcaagaac 1560 

acctacgtgg accccctccc aagcctcgtc cacccgagga cgggccgcct ccacacccgc 1620 

ttcaaccaga cggccacggc cacggggagg cttagtagct ccgaccccaa cctgcagaac 1680 

atccccgtcc gcaccccctt gggccagagg atccgccggg ccttcgtggc cgaggcgggt 174 0 

tgggcgttgg tggccctgga ctatagccag atagagctcc gcgtcctcgc ccacctctcc 1800 

ggggacgaaa acctgatcag ggtcttccag gaggggaagg acatccacac ccagaccgca 1860 

agctggatgt tcggcgtccc cccggaggcc gtggaccccc tgatgcgccg ggcggccaag 192 0 

acggtgaact tcggcgtcct ctacggcatg tccgcccata ggctctccca ggagcttgcc 1980 

atcccctacg aggaggcggt ggcctttata gagcgctact tccaaagctt ccccaaggtg 2040 

cgggcctgga tagaaaagac cctggaggag gggaggaagc ggggctacgt ggaaaccctc 210 0 

ttcggaagaa ggcgctacgt gcccgacctc aacgcccggg tgaagagcgt cagggaggcc 2160 

gcggagcgca tggccttcaa catgcccgtc cagggcaccg ccgccgacct catgaagctc 2220 

gccatggtga agctcttccc ccgcctccgg gagatggggg cccgcatgct cctccaggtc 2280 

gccaacgagc tcctcctgga ggccccccaa gcgcgggccg aggaggtggc ggctttggcc 2340 

aaggaggcca tggagaaggc ctatcccctc gccgtgcccc tggaggtgga ggtggggatg 2400 

ggggaggact ggctttccgc caagggtcac caccaccacc accac 2445 



<210> 582 

<211> 815 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 582 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 
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Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 



Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Glu Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu lie Lys Glu Met Val Asp Leu Leu Gly Phe Thr Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro Asp 
165 170 175 

Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn Leu 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu 
195 200 205 

Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu Lys 
210 215 220 

Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu Lys Leu 
225 230 235 240 

Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu Val Asp 
245 250 255 

Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Gly Glu Lys Pro Arg Glu 
260 265 270 

Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu 
275 280 285 

Ser Arg Pro Glu Pro Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys 
290 295 300 

Arg Gly Gly Arg Val His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys 
305 310 315 320 

Asp Leu Lys Glu Val Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu 
325 330 335 
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Ala Ser Arg Glu Gly Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu 
340 345 350 

Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala 
355 360 365 

Arg Arg -Tyr Gly Gly Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu 
370 375 380 

Leu Ser Glu Arg Leu His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu 
385 390 395 400 

Glu Lys Leu Leu Trp Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg 
405 410 415 

Val Leu Ala His Met Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr 
420 425 430 

Leu Gin Ala Leu Ser Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu 
435 440 445 

Glu Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg 
450 455 460 

Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu 
465 470 475 480 

Lys Lys Thr Lys Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu 
485 490 495 

Glu Ala Leu Arg Glu Ala His Pro He Val Glu Lys He Leu Gin His 
500 505 510 

Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser 
515 520 525 

Leu Val His Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr 
530 535 540 

Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn 
545 550 555 560 

He Pro Val Arg Thr Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val 
565 570 575 

Ala Glu Ala Gly Trp Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu 
580 585 590 

Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu He Arg Val 
595 600 605 

Phe Gin Glu Gly Lys Asp He His Thr Gin Thr Ala Ser Trp Met Phe 
610 615 620 

Gly Val Pro Pro Glu Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys 
625 630 635 640 
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Thr Val Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser 
645 650 655 

Gin Glu Leu Ala He Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg 
660 665 670 

Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu 
675 680 685 

Glu Glu Gly Arg Lys Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg 
690 695 700 

Arg Tyr Val Pro Asp Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala 
705 710 715 720 

Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp 
725 730 735 

Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met 
740 745 750 

Gly Ala Arg Met Leu Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala 
755 760 765 

Pro Gin Ala Arg Ala Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met 
770 775 780 

Glu Lys Ala Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val Gly Met 
785 790 795 800 

Gly Glu Asp Trp Leu Ser Ala Lys Gly His His His His His His 
805 810 815 

<210> 583 
<211> 2520 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 583 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 



60 
120 



ggccaccacc tggcctaccg tacctttttt gccctgaagg gcctcaccac cagccggggg 
gagccggtcc aggcggtgta cgggtttgcc aagagccttt tgaaggcgct aagagaagac 180 
ggggacgcgg tgatcgtggt ctttgacgcc gaggccccct ccttccgcca cgaggcctac 
ggggggtaca aggcggggcg ggctcccacc cccgaggact ttccccggca gcttgccctt 
atcaaggagc tggtggacct cctggggttt acccgcctcg aggtccccgg ctacgaggcg 
gacgacgttc tcgccaccct ggccaagaag gcggaaaagg aggggtacga ggtgcgcatc 
ctcaccgccg acaaagacct ttaccagctc ctttccgacc gcatccacgt cctccacccc 



240 
300 
360 
420 
480 
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gaggggtacc tcatcacccc ggcctggctt tgggaaaagt acggcctgag gcccgaccag 
tgggccgact accgggccct gaccggggac gagtccgaca accttcccgg ggtcaagggc 
atcggggaga agaccgccct caagctcctc aaggagtggg ggagcctgga agccctcctc 
aagaacctgg accggctgaa gcccgccatc cgggagaaga tcctggccca catggacgat 
ctgaagctct cctgggacct ggccaaggtg cgcaccgacc tgcccctgga ggtggacttc 
gccaaaaggc gggagcccga ccgggagggg cttaaggcct ttttggagag gctggagttc 
ggcagcctcc tccacgagtt cggcctcctg ggaggggaga agccccggga ggaggccccc 
tggcccccgc cggaaggggc cttcgtgggc tttgtgcttt cccgcaagga gcccatgtgg 
gccgatcttc tggccctggc cgcctgcagg ggcggccgcg tgcaccgggc agcagacccc 
ttggcggggc taaaggacct caaggaggtc cggggcctcc tcgccaagga cctcgccgtc 
ttggcctcga gggaggggct agacctcgtg cccggggacg accccatgct cctcgcctac 
ctcctggacc cttcgaacac cacccccgag ggggtggcgc ggcgctacgg gggggagtgg 
acggaggacg ccgcccaccg ggccctcctc tcggagaggc tccatcggaa cctccttaag 
cgcctcgagg gggaggagaa gctcctttgg ctctaccacg aggtggaaaa gcccctctcc 
cgggtcctgg cccatatgga ggccaccggg gtacggcggg acgtggccta ccttcaggcc 
ctttccctgg agcttgcgga ggagatccgc cgcctcgagg aggaggtctt ccgcttggcg 
ggccacccct tcaacctcaa ctcccgggac cagctggaaa gggtgctctt tgacgagctt 
aggcttcccg ccttgaagaa gacgaagaag acaggcaagc gctccaccag cgccgcggtg 
ctggaggccc tacgggaggc ccaccccatc gtggagaaga tcctccagca ccgggagctc 
accaagctca agaacaccta cgtggacccc ctcccaagcc tcgtccaccc gaggacgggc 
cgcctccaca cccgcttcaa ccagacggcc acggccacgg ggaggcttag tagctccgac 
cccaacctgc agaacatccc cgtccgcacc cccttgggcc agaggatccg ccgggccttc 
gtggccgagg cgggttgggc gttggtggcc ctggactata gccagataga gctccgcgtc 
ctcgcccacc tctccgggga cgaaaacctg atcagggtct tccaggaggg gaaggacatc 
cacacccaga ccgcaagctg gatgttcggc gtccccccgg aggccgtgga ccccctgatg 1980 
cgccgggcgg ccaagacggt gaacttcggc gtcctctacg gcatgtccgc ccataggctc 2 04 0 
tcccaggagc ttgccatccc ctacgaggag gcggtggcct ttatagagcg ctacttccaa 2100 
agcttcccca aggtgcgggc ctggatagaa aagaccctgg aggaggggag gaagcggggc 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



2160 
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tacgtggaaa ccctcttcgg aagaaggcgc tacgtgcccg acctcaacgc ccgggtgaag 2220 

agcgtcaggg aggccgcgga gcgcatggcc ttcaacatgc ccgtccaggg caccgccgcc 22 80 

gacctcatga agctcgccat ggtgaagctc ttcccccgcc tccgggagat gggggcccgc 2340 

atgctcctcc aggtcgccaa cgagctcctc ctggaggccc cccaagcgcg ggccgaggag 2400 

gtggcggctt tggccaagga ggccatggag aaggcctatc ccctcgccgt gcccctggag 2460 

gtggaggtgg ggatggggga ggactggctt tccgccaagg gtcaccacca ccaccaccac 2 52 0 



<210> 584 

<211> 840 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 584 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Ala Val 
50 55 60 

lie Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr 
65 70 75 80 

Gly Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg 
85 90 95 

Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg 
100 105 110 

Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala 
115 120 125 

Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp 
130 135 140 

Lys Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro 
145 150 155 160 

Glu Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu 
165 170 175 

Arg Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser 
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180 



185 



190 



Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu Lys 
195 200 205 

Leu Leu Lys Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp 
210 215 220 

Arg Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp 
225 230 235 240 

Leu Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys 
260 265 270 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro Pro Pro 
290 295 300 

Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp 
305 310 315 320 

Ala Asp Leu Leu Ala Leu Ala Ala Cys Arg Gly Gly Arg Val His Arg 
325 330 335 

Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 
340 345 350 

Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 
355 360 365 

Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 
370 375 380 

Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp 
385 390 395 400 

Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg 
405 410 415 

Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 
450 455 460 

Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 



1065 



485 



490 



495 



Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 
515 520 525 

Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys 
530 535 540 

Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 
595 600 605 

Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie 
625 630 635 640 

His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val 
645 650 655 

Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr 
675 680 685 

Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 
725 730 735 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 
770 775 780 



Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
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785 



790 795 800 



Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 
820 825 830 

Lys Gly His His His His His His 
835 840 

<210> 585 

<211> 2499 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 585 

atgaattccc tgcccctctt tgagcccaag agccgggtgc ttctggtgga cggccaccac 60 

ctggcctacc gcacctcctt cgccctgaag ggcctcacca ccagccgcgg ggagccggtg 12 0 

cagatggtct acggcttcgc ccggagcctc ctcaaggcct tgaaggagga cggacaggcg 180 

gtggtcgtgg tctttgacgc caaggccccc tccttccgcc acgaggccta cgaggcctac 240 

aaggcgggcc gggcccccac cccggaggac ttccccggca gctcgccctt atcaaggaga 3 00 

tggtggacct tttgggcctg gcgcgcctcg aggtcccggg ctacgaggcg gacgacgttc 3 60 

tcgccaccct ggccaagaag gcggaaaagg aggggtacga ggtgcgcatt ctaccgccga 42 0 
ccgcgacctc taccaactcg tctccgaccg cgtcgccgtc ctccaccccg agggccacct 



480 



catcaccccg gagtggcttt gggagaagta cggcctcagg ccggagcagt gggtggactt 54 0 

ccgcgccctc gtgggggacc cctccgacaa cctccccggg gtcaagggca tcggggagga 60 0 

gacggcggcc aagctgatcc gggagtgggg aagcctggaa aaccttctta agcacctgga 660 

acaggcgaaa cctgcctccg tgcgggagaa gatccttagc cacatggagg acctcaagct 720 
atccctggag ctatcccggg tgcacacgga cttgcttctt cagtggactt taaggccctg 



780 



cgccgcagga cccccgacct ggagggcctg agggcctttt tggaggagct ggagttcggc 84 0 



900 
960 



agcctcctcc acgagttcgg cctcctggag gcccccgccg cggcggagga agctccctgg 
ccgccccccg agggagcctt cgtggggtac gttctttccc gccccgagcc catgtgggcg 

gagcttaacg ccttggccgc cgcctggggc ggccgcgtgc accgggcagc agaccccttg 102 0 

gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 1080 

gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 114 0 
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ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tgaagaagac 


gaagaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcgtg 


1800 


gccgaggcgg 


gttgggcgtt 


ggtggccctg 


gactatagcc 


agatagagct 


ccgcgtcctc 


1860 


gcccacctct 


ccggggacga 


aaacctgatc 


agggtcttcc 


aggaggggaa 


ggacatccac 


1920 


acccagaccg 


caagctggat 


gttcggcgtc 


cccccggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agacggtgaa 


cttcggcgtc 


ctctacggca 


tgtccgccca 


taggctctcc 


2040 


caggagcttg 


ccatccccta 


cgaggaggcg 


gtggccttta 


tagagcgcta 


cttccaaagc 


2100 


ttccccaagg 


tgcgggcctg 


gatagaaaag 


accctggagg 


aggggaggaa 


gcggggctac 


2160 


gtggaaaccc 


tcttcggaag 


aaggcgctac 


gtgcccgacc 


tcaacgcccg 


ggtgaagagc 


2220 


gtcagggagg 


ccgcggagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tcgccatggt 


gaagctcttc 


ccccgcctcc 


gggagatggg 


ggcccgcatg 


2340 


ctcctccagg 


tcgccaacga 


gctcctcctg 


gaggcccccc 


aagcgcgggc 


cgaggaggtg 


2400 


gcggctttgg 


ccaaggaggc 


catggagaag 


gcctatcccc 


tcgccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


tgggggagga 


ctggctttcc 


gccaagggt 






2499 



<210> 586 

<211> 811 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 586 
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Met Glu Phe Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 



Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Gly Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Val Ser Asp Arg Val Ala Arg Pro Glu Gin Trp Val Asp 
145 150 155 160 

Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn lie Pro Gly Val Lys 
165 170 175 

Gly Xle Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu Glu Trp Gly Ser 
180 185 190 

Val Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu Lys Pro Ala lie Arg 
195 200 205 

Glu Lys lie Leu Ala His Met Glu Asp Leu Lys Leu Ser Leu Glu Leu 
210 215 220 

Ser Arg Val Arg Thr Asp Leu Pro Leu Glu Val Asp Leu Ala Gin Gly 
225 230 235 240 

Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe Leu Glu Arg Leu Glu 
245 250 255 

Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu Glu Ser Pro Val Ala 
260 265 270 

Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Tyr 
275 280 285 



Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu Leu Asn Ala Leu Ala 
290 295 300 
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Ala Ala Trp Gly Gly Arg Val His Arg Ala Ala Asp Pro Leu Ala Gly 
305 310 315 320 



Leu Lys Asp Leu Lys Glu Val Arg Gly Leu Leu Ala Lys Asp Leu Ala 
325 330 335 

Val Leu Ala Ser Arg Glu Gly Leu Asp Leu Val Pro Gly Asp Asp Pro 
340 345 350 

Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly 
355 360 365 

Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu Asp Ala Ala His Arg 
370 375 380 

Ala Leu Leu Ser Glu Arg Leu His Arg Asn Leu Leu Lys Arg Leu Glu 
385 390 395 400 

Gly Glu Glu Lys Leu Leu Trp Leu Tyr His Glu Val Glu Lys Pro Leu 
405 410 415 

Ser Arg Val Leu Ala His Met Glu Ala Thr Gly Val Arg Arg Asp Val 
420 425 430 

Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu Ala Glu Glu lie Arg Arg 
435 440 445 

Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn 
450 455 460 

Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu Arg Leu Pro 
465 470 475 480 

Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala 
485 490 495 

Val Leu Glu Ala Leu Arg Glu Ala His Pro lie Val Glu Lys lie Leu 
500 505 510 

Gin His Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr Val Asp Pro Leu 
515 520 525 

Pro Ser Leu Val His Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn 
530 535 540 

Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu 
545 550 555 560 

Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin Arg lie Arg Arg Ala 
565 570 575 

Phe Val Ala Glu Ala Gly Trp Ala Leu Val Ala Leu Asp Tyr Ser Gin 
580 585 590 



lie Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu lie 
595 600 605 
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Arg Val Phe Gin Glu Gly Lys Asp lie His Thr Gin Thr Ala Ser Trp 
610 615 620 

Met Phe Gly Val Pro Pro Glu Ala Val Asp Pro Leu Met Arg Arg Ala 
625 630 635 640 

Ala Lys Thr Val Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg 
645 650 655 

Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu Glu Ala Val Ala Phe He 
660 665 670 

Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp lie Glu Lys 
675 680 685 

Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr Val Glu Thr Leu Phe Gly 
690 695 700 

Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg Val Lys Ser Val Arg 
705 710 715 720 

Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala 
725 730 735 

Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu Arg 
740 745 750 

Glu Met Gly Ala Arg Met Leu Leu Gin Val Ala Asn Glu Leu Leu Leu 
755 760 765 

Glu Ala Pro Gin Ala Arg Ala Glu Glu Val Ala Ala Leu Ala Lys Glu 
770 775 780 

Ala Met Glu Lys Ala Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val 
785 790 795 800 

Gly Met Gly Glu Asp Trp Leu Ser Ala Lys Gly 
805 810 

<210> 587 
<211> 2433 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 587 

atggaattcc tgcccctctt tgagcccaag ggccgggtgc ttctggtgga cggccaccac 
ctggcctacc gcaccttcca cgccctgaag ggcctcacca ccagccgcgg ggagccggtc 
caggcggtgt acgggtttgc caagagcctt ttgaaggcgc taagagaaga cggggatgtg 
gtgatcgtgg tctttgacgc caaggccccc tccttccgcc acgaggccta cggggggtac 
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aaggcgggcc 


gggccccgac 


ccccgaggac 


ttcccccggc 


agctcgccct 


catcaaggag 


300 


ctggtggacc 


tcctggggct 


ggcgcgcctc 


gaggtgccgg 


gctttgaagc 


ggatgacgtc 


360 


ctggctaccc 


tggccaagaa 


ggcggaaaag 


gagggctacg 


aggtgcgcat 


tctcaccggc 


420 


gaccgcgacc 


tttaccaact 


cgtctccgac 


cgcgtcgcca 


ggccggagca 


gtgggtggac 


480 


taccgggcct 


tggccgggga 


cccttccgac 


aacatccccg 


gcgtgaaggg 


catcggggag 


540 


aagacggcga 


ggaagcttct 


ggaggagtgg 


gggagcgtgg 


aagccctcct 


caagaacctg 


600 


gaccggctga 


agcccgccat 


ccgggagaag 


atcctggccc 


acatggagga 


cctcaagcta 


660 


tccctggagc 


tatcccgggt 


gcgcaccgac 


ctccccctgg 


aggtggacct 


cgcccagggg 


720 


cgggagcccg 


accgggaggg 


gcttaaggcc 


tttttggaga 


ggctggagtt 


cggaagcctc 


780 


ctccacgagt 


tcggcctgtt 


ggaaagcccg 


gtggcggcgg 


aggaagctcc 


ctggccgccc 


840 


cccgagggag 


ccttcgtggg 


gtacgttctt 


tcccgccccg 


agcccatgtg 


ggcggagctt 


900 


aacgccttgg 


ccgccgcctg 


gggcggccgc 


gtgcaccggg 


cagcagaccc 


cttggcgggg 


960 


ctaaaggacc 


tcaaggaggt 


ccggggcctc 


ctcgccaagg 


acctcgccgt 


cttggcctcg 


1020 


agggaggggc 


tagacctcgt 


gcccggggac 


gaccccatgc 


tcctcgccta 


cctcctggac 


1080 


ccttcgaaca 


ccacccccga 


gggggtggcg 


cggcgctacg 


ggggggagtg 


gacggaggac 


1140 


gccgcccacc 


gggccctcct 


ctcggagagg 


ctccatcgga 


acctccttaa 


gcgcctcgag 


1200 


9999^99^93. 


agctcctttg 


gctctaccac 


gaggtggaaa 


agcccctctc 


ccgggtcctg 


1260 


gcccatatgg 


aggccaccgg 


ggtacggcgg 


gacgtggcct 


accttcaggc 


cctttccctg 


1320 


gagcttgcgg 


aggagatccg 


ccgcctcgag 


gaggaggtct 


tccgcttggc 


gggccacccc 


1380 


ttcaacctca 


actcccggga 


ccagctggaa 


agggtgctct 


ttgacgagct 


taggcttccc 


1440 


gccttgaaga 


^gacgaagaa 


gacaggcaag 


cgctccacca 


gcgccgcggt 


gctggaggcc 


1500 


ctacgggagg 


cccaccccat 


cgtggagaag 


atcctccagc 


accgggagct 


caccaagctc 


1560 


aagaacacct 


acgtggaccc 


cctcccaagc 


ctcgtccacc 


cgaggacggg 


ccgcctccac 


1620 


acccgcttca 


accagacggc 


cacggccacg 


gggaggctta 


gtagctccga 


ccccaacctg 


1680 


cagaacatcc 


ccgtccgcac 


ccccttgggc 


cagaggatcc 


gccgggcctt 


cgtggccgag 


1740 


gcgggttggg 


cgttggtggc 


cctggactat 


agccagatag 


agct ccgcgt 


cct cgcccac 


1 R n n 


ctctccgggg 


acgaaaacct 


gatcagggtc 


ttccaggagg 


ggaaggacat 


ccacacccag 


1860 


accgcaagct 


ggatgttcgg 


cgtccccccg 


gaggccgtgg 


accccctgat 


gcgccgggcg 


1920 


gccaagacgg 


tgaacttcgg 


cgtcctctac 


ggcatgtccg 


cccataggct 


ctcccaggag 


1980 
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2160 
2220 
2280 
2340 
2400 
2433 



cttgccatcc cctacgagga ggcggtggcc tttatagagc gctacttcca aagcttcccc 2 04 0 
aaggtgcggg cctggataga aaagaccctg gaggagggga ggaagcgggg ctacgtggaa 2100 
accctcttcg gaagaaggcg ctacgtgccc gacctcaacg cccgggtgaa gagcgtcagg 
gaggccgcgg agcgcatggc cttcaacatg cccgtccagg gcaccgccgc cgacctcatg 
aagctcgcca tggtgaagct cttcccccgc ctccgggaga tgggggcccg catgctcctc 
caggtcgcca acgagctcct cctggaggcc ccccaagcgc gggccgagga ggtggcggct 
ttggccaagg aggccatgga gaaggcctat cccctcgccg tgcccctgga ggtggaggtg 
gggatggggg aggactggct ttccgccaag ggt 



<210> 588 

<211> 811 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 588 

Met Glu Phe Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val He Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Gly Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Val Ser Asp Arg Val Ala Arg Pro Glu Gin Trp Val Asp 
145 150 155 160 
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Tvr Arq Ala Leu Ala Gly Asp Pro Ser Asp Asn He Pro Gly Val Lys 
165 170 175 

Gly He Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu Glu Trp Gly Ser 
180 185 190 

Val Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu Lys Pro Ala He Arg 
195 200 205 

Glu Lys He Leu Ala His Met Glu Asp Leu Lys Leu Ser Leu Glu Leu 
210 215 220 

Ser Arg Val Arg Thr Asp Leu Pro Leu Glu Val Asp Leu Ala Gin Gly 
225 230 235 240 

Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe Leu Glu Arg Leu Glu 
245 250 255 

Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu Glu Ser Pro Val Ala 
260 265 270 

Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Tyr 
275 280 - 285 

Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu Leu Asn Ala Leu Ala 
290 295 300 

Ala Ala Trp Gly Gly Arg Val His Arg Ala Ala Asp Pro Leu Ala Gly 
305 310 315 320 

Leu Lys Asp Leu Lys Glu Val Arg Gly Leu Leu Ala Lys Asp Leu Ala 
325 330 335 

Val Leu Ala Ser Arg Glu Gly Leu Asp Leu Val Pro Gly Asp Asp Pro 
340 345 350 

Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly 
355 360 365 

Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu Asp Ala Ala His Arg 
370 375 380 

Ala Leu Leu Ser Glu Arg Leu His Arg Asn Leu Leu Lys Arg Leu Glu 
385 390 395 400 

Gly Glu Glu Lys Leu Leu Trp Leu Tyr His Glu Val Glu Lys Pro Leu 
405 410 415 

Ser Arg Val Leu Ala His Met Glu Ala Thr Gly Val Arg Arg Asp Val 
420 425 430 

Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu Ala Glu Glu He Arg Arg 
435 440 445 

Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn 
450 455 460 
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Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu Arg Leu Pro 
465 470 475 480 

Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala 
485 490 495 

Val Leu Glu Ala Leu Arg Glu Ala His Pro He Val Glu Lys He Leu 
500 505 510 

Gin His Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr Val Asp Pro Leu 
515 520 525 

Pro Ser Leu Val His Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn 
530 535 540 

Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu 
545 550 555 560 

Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin Arg He Arg Arg Ala 
565 570 575 

Phe Val Ala Glu Ala Gly Trp Ala Leu Val Ala Leu Asp Tyr Ser Gin 
580 585 590 

He Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu He 
595 600 605 

Arg Val Phe Gin Glu Gly Lys Asp He His Thr Gin Thr Ala Ser Trp 
610 615 620 

Met Phe Gly Val Pro Pro Glu Ala Val Asp Pro Leu Met Arg Arg Ala 
625 630 635 640 

Ala Lys Thr Val Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg 
645 650 655 

Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu Ala Val Ala Phe He 
660 665 670 

Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys 
675 680 685 

Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr Val Glu Thr Leu Phe Gly 
690 695 700 

Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg Val Lys Ser Val Arg 
705 710 715 720 

Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala 
725 730 735 

Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu Arg 
740 745 750 

Glu Met Gly Ala Arg Met Leu Leu Gin Val Ala Asn Glu Leu Leu Leu 
755 760 765 
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Glu Ala Pro Gin Ala Arg Ala Glu Glu Val Ala Ala Leu Ala Lys Glu 
770 775 780 

Ala Met Glu Lys Ala Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val 
785 790 795 800 

Gly Met Gly Glu Asp Trp Leu Ser Ala Lys Gly 
805 810 

<210> 589 
<211> 2493 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 589 





atgaattccc 


tgcccctctt 


tgagcccaag ggccgggtcc 


tcctggtgga 


cggccaccac 


60 




ctggcctacc 


gtaccttttt 


tgccctgaag ggcctcacca 


cctcccgggg 


ggagccggtg 


120 




cagatggtct 


acggcttcgc 


ccggagcctc 


ctcaaggccc 


tcaaggagga 


cggggacgcg 


180 




gtgatcgtgg 


tctttgacgc 


cgaggccccc 


tccttccgcc 


accagaccta 


cgaggcctac 


240 




aaggcgggga 


gggctcccac 


ccccgaggac 


tttccccggc 


agcttgccct 


tatcaaggag 


300 


s 


ctggtggacc 


tcctggggtt 


tacccgcctc 


gaggtccccg 


gctacgaggc 


ggacgacgtt 


360 




ctcgccaccc 


tggccaagaa 


ggcggaaaag 


gagggctacg 


aggtgcgcat 


cctcaccgcg 


420 




gaccgggacc 


tttaccagct 


tctttccgac 


cgcattcacg 


tccttcaccc 


cgaggggtac 


480 




ctcatcaccc 


cggcctggct 


ttgggaaaag 


tacggcttga 


ggcccgacca 


gtgggccgac 


540 




taccgggccc 


tgaccgggga 


cgaatccgac 


aacctttccg 


gggtcaaggg 


catcggggag 


600 




aagacggcga 


ggaagcttct 


ggaggagtgg 


gggagcctgg 


aagccctcct 


caagaacctg 


660 




gaccggctga 


agcccgccat 


ccgggagaag 


atcctggccc 


acatggacga 


tctgaagctc 


720 




tccttggagc 


tctcccgggt 


gcgcaccgac 


ctccccctgg 


aggtggactt 


cgccaaaagg 


780 




cgggagcccg 


accgggagag 


gcttagggcc 


tttctggaga 


ggcttgagtt 


tggcagcctc 


840 




ctccacgagt 


tcggcccttt 


ggaaagcccc 


agggccgcgg 


aggaagctcc 


ctggccgccc 


900 




cccgagggag 


ccttcgtggg 


gtacgttctt 


tcccgccccg 


agcccatgtg 


ggcggagctt 


960 




aacgccttgg 


ccgccgccag 


gggcggccgc 


gtgcaccggg 


cagcagaccc 


cttggcgggg 


1020 




ctaaaggacc 


tcaaggaggt 


ccggggcctc 


ctcgccaagg 


acctcgccgt 


cttggcctcg 


1080 




agggaggggc 


tagacctcgt 


gcccggggac 


gaccccatgc 


tcctcgccta 


cctcctggac 


1140 
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ccttcgaaca ccacccccga gggggtggcg cggcgctacg ggggggagtg gacggaggac 12 0 0 

gccgcccacc gggccctcct ctcggagagg ctccatcgga acctccttaa gcgcctcgag 1260 

ggggaggaga agctcctttg gctctaccac gaggtggaaa agcccctctc ccgggtcctg 132 0 

gcccatatgg aggccaccgg ggtacggcgg gacgtggcct accttcaggc cctttccctg 1380 

gagcttgcgg aggagatccg ccgcctcgag gaggaggtct tccgcttggc gggccacccc 144 0 

ttcaacctca actcccggga ccagctggaa agggtgctct ttgacgagct taggcttccc 1500 

gccttgaaga agacgaagaa gacaggcaag cgctccacca gcgccgcggt gctggaggcc 1560 

ctacgggagg cccaccccat cgtggagaag atcctccagc accgggagct caccaagctc 162 0 

aagaacacct acgtggaccc cctcccaagc ctcgtccacc cgaggacggg ccgcctccac 1680 

acccgcttca accagacggc cacggccacg gggaggctta gtagctccga ccccaacctg 174 0 

cagaacatcc ccgtccgcac ccccttgggc cagaggatcc gccgggcctt cgtggccgag 1800 

gcgggttggg cgttggtggc cctggactat agccagatag agctccgcgt cctcgcccac 1860 

ctctccgggg acgaaaacct gatcagggtc ttccaggagg ggaaggacat ccacacccag 192 0 

accgcaagct ggatgttcgg cgtccccccg gaggccgtgg accccctgat gcgccgggcg 1980 

gccaagacgg tgaacttcgg cgtcctctac ggcatgtccg cccataggct ctcccaggag 2 040 

cttgccatcc cctacgagga ggcggtggcc tttatagagc gctacttcca aagcttcccc 2100 

aaggtgcggg cctggataga aaagaccctg gaggagggga ggaagcgggg ctacgtggaa 2160 

accctcttcg gaagaaggcg ctacgtgccc gacctcaacg cccgggtgaa gagcgtcagg 2220 

gaggccgcgg agcgcatggc cttcaacatg cccgtccagg gcaccgccgc cgacctcatg 2280 

aagctcgcca tggtgaagct cttcccccgc ctccgggaga tgggggcccg catgctcctc 234 0 

caggtcgcca acgagctcct cctggaggcc ccccaagcgc gggccgagga ggtggcggct 2400 

ttggccaagg aggccatgga gaaggcctat cccctcgccg tgcccctgga ggtggaggtg 246 0 

gggatggggg aggactggct ttccgccaag ggt 2493 



<210> 590 

<211> 831 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 590 
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Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
1 5 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe Ala Arg 
35 40 45 

Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He Val Val 
50 55 60 

Phe Asp Ala Glu Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu Glu Val 
100 105 110 

Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro Asp 
165 170 175 

Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn Leu 
180 185 190 

Ser Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu 
195 200 205 

Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu Lys 
210 215 220 

Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu Lys Leu 
225 230 235 240 

Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu Pro Leu Glu Val Asp 
245 250 255 

Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala Phe Leu 
260 265 270 

Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Pro Leu Glu 
275 280 285 

Ser Pro Arg Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala 
290 295 300 
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Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu Leu 
305 310 315 320 

Asn Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala Ala Asp 
325 330 335 

Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu Leu Ala 
340 345 350 

Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu Val Pro 
355 360 365 

Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr 
370 375 380 

Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu Asp 
385 390 395 400 

Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn Leu Leu 
405 410 415 

Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His Glu Val 
420 425 430 

Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly Val 
435 440 445 

Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu Ala Glu 
450 455 460 

Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His Pro 
465 470 475 480 

Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu 
485 490 495 

Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys Arg Ser 
500 505 510 

Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He Val 
515 520 525 

Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr 
530 535 540 

Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg Leu His 
545 550 555 560 

Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser 
565 570 575 

Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin Arg 
580 585 590 

He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val Ala Leu 
595 600 605 
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Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp 
610 615 620 



Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His Thr Gin 
625 630 635 640 

Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Pro Leu 
645 650 655 

Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr Gly Met 
660 665 670 

Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu Ala 
675 680 685 

Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala 
690 695 700 

Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr Val Glu 
705 710 715 720 

Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg Val 
725 730 735 

Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val 
740 745 750 

Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu Phe 
755 760 765 

Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val Ala Asn 
770 775 780 

Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val Ala Ala 
785 790 795 800 

Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val Pro Leu 
805 810 815 

Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys Gly 
820 825 830 

<210> 591 
<211> 23 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 591 

agaaaggaag ggaagaaagc gaa 



<210> 592 
<211> 45 
<212> DNA 



1080 



<213> Artificial 
<220> 

<223> Synthetic 
<400> 592 

tggaggtcaa aacatcgata agtcgaagaa aggaagggaa gaaat 



<210> 593 

<211> 14 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 593 
tgttttgacc tcca 



<210> 594 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 594 

ggcgaccaca cccgtcctgt 

<210> 595 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 595 

ccacgatgcg tccggcgtag 



<210> 596 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 596 

acgggtcaat gtccatgccc caaaga 
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<210> 597 

<211> 41 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<221> modif ied__base 

<222> (23) . . (23) 

<223> /note= "The residue at this position indicates 2' o-methyl sugar" 



<400> 597 

gtctgagatg aaagtgctcc cgcacccacc caaggcacag c 41 



<210> 598 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 598 

tcttcgcaca tttcatctca gacgga 26 



<210> 599 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<222> Synthetic 
<220> 

<221> modif ied_base 

<222> (1) . . (17) 

<223> /note= "The residues at these positions are 2' o-methyl sugars" 



<400> 599 

gctgtgcctt gggtggg 17 



<210> 600 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<220> 

<221> modif ied__base 

<222> (1) . . (21) 

<223> /note= "The residues at these positions are 2' o-methyls" 



<400> 600 

gctgtgcctt gggtgggtgc g 21 



<210> 601 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 601 

aacgaggcgc acccacccaa ggcacagc 2 8 



<210> 602 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<221> modif ied_base 

<222> (23) . , (27) 

<223> /note= "The residues at these positions are 2' o-methyls" 



<400> 602 

gtctgagatg aaagtgcgcc tcgttaa 2 7 



<210> 503 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 603 

aacgaggcgc acccacccaa ggcacagc 2 8 



<210> 604 

<211> 23 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 604 

tcttcgcaca tttcatctca gac 



<210> 605 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 605 

gctgtgcctt gggtgggtgc g 



<210> 606 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 606 

tcttcgcaca tttcatctca gacgga 



<210> 607 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 607 

gctgtgcctt gggtgggtgc 



<210> 608 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<221> modif ied_base 

<222> (15) . , (20) 

<223> /note= "The residues at these positions are 2' o-methyl sugars 
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<400> 608 

gctgtgcctt gggtgggtgc 



<210> 609 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 
<220> 

<221> modif ied_base 

<222> (1) . . (20) 

<223> /note= "The residues at these 



<400> 609 

gctgtgcctt gggtgggtgc 



<210> 610 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 610 

acgggtcaat gtccatgccc caaaga 

<210> 611 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 611 

aacgaggcgc acccacccaa ggcacagc 

<210> 612 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<221> modif ied_base 

<222> (23) . - (27) 



positions are 2' o-methyl sugars" 



20 



26 
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<223> /note= "The residues at these positions are 2' o-methyls" 



<400> 612 

gtctgagatg aaagtgcgcc tcgttaa 27 



<210> 613 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<221> THodif ied_base 

<222> (1) . . (17) 

<223> /note= "The residues at these positions are 2' o-methyls" 



<400> 613 
gctgtgcctt gggtggg 



<210> 614 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<221> modif ied_base 

<222> (1) . . (19) 

<223> /note= "The residues at these positions are 2' o-methyls" 



<400> 614 

gctgtgcctt gggtgggtg 



<210> 615 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 
<220> 

<221> modif ied_base 

<222> (1) . . (21) 

<223> /note= "The residues at these positions are 2' o-methyls" 
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<400> 615 

gctgtgcctt gggtgggtgc g 



<210> 616 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<221> modif ied_base 

<222> (1)..(22) 

<223> /note= "The residues at these positions are 2' o-me 



<400> 616 

gctgtgcctt gggtgggtgc gc 



<210> 617 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 617 

ccgtcaacat ttaccatggg tgcgga 



<210> 618 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 618 

ccgccacctc gtagtcgaca tccttttcgt g 



31 



<210> 619 

<211> 29 

<212> DMA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 619 

ttgataaatt tggggtggaa aggtttgga 
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<210> 620 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 620 
agaactgaga ggaggcg 



<210> 621 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 621 

aacgaggcgc accaaactca ctcat 



<210> 


622 


<211> 


35 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


622 



gtcatgtagg cttctatgta gttgatgaag atgta 



<210> 623 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 623 

ggctttgtag atgcctttct cttgga 



<210> 624 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



1088 



<400> 624 
atgagtgagt ttggtgcg 



<210> 625 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 625 

ccaggaagca agtggtgcgc ctcgttt 



<210> 626 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 626 

aacgaggcgc accttggagg ca 



<210> 627 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 627 

aaggtttcct tctcagttgt gtta 



<210> 628 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 628 

gcaaagatgt ctgttacggt caactc 



<210> 629 

<211> 16 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 

<400> 629 
tgcctccaag gtgcgc 



<210> 630 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 630 

aacgaggcgc accttcaaaa tgcctaa 



<210> 631 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 631 

tgtcactctc ctctttccaa tta 



<210> 632 

<211> 29 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 632 

gaaaagagtt ccattatccg ctacatctg 



<210> 633 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 633 

ttaggcattt tgaaggtgcg c 



<210> 634 
<211> 23 



<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 634 

aacgaggcgc accgttgtgt ccc 



<210> 635 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 635 

gggatgtaga agccattcag a 



<210> 636 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 636 

ttgttgtgct gtgggggatg 



<210> 637 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 
<400> 637 

17 

gggacacaac ggtgcgc 



<210> 638 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 638 

1 3 

ccgtcacgcc tccaccatat ccc 
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<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 639 

ccagcggttt ccattggcaa agat 



<210> 640 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 640 

cggaagaatg ggtcgaccat g 



<210> 641 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 641 
gggatatggt ggaggcg 

<210> 642 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 642 

aacgaggcgc accgttccag go 



<210> 643 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 



109Z 



<400> 643 

catatccatg cagcaccacc atga 



24 



<210> 644 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 644 

caaaatacag agtgaacaca gggcc 



<210> 645 

<211> 16 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 645 
gcctggaacg gtgcgc 



<210> 646 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 646 

ccgtcacgcc tcatggataa tgccc 



<210> 647 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 647 

caggtgagaa aaggcattac agatagtgaa age 



<210> 648 

<211> 22 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 648 

cagaggaaag agagctgcag gg 



<210> 649 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 649 

gggcattatc catgaggcg 



<210> 650 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 650 

ccgtcacgcc tccctgctga gaaa 



<210> 651 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 651 

cccgaggcat gcacggcgga 



<210> 652 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 652 
ggcaggaagg cctcc 



<210> 653 
<211> 18 



<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 653 
tttctcagca gggaggcg 



<210> 654 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 654 

ccgtcacgcc tcgccccaca 



<210> 655 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 655 

cagcacaggc tgttgaccat cataaaac 



<210> 656 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 

<400> 656 

cttttccata ctttttatga cattc 



<210> 657 

<211> 14 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 657 
tgtggggcga ggcg 



<210> 658 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 658 

aacgaggcgc acagttgacc ttc 



<210> 659 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 659 

gtgatggcca gcacagggc 



<210> 660 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 660 

atacgttccc cacatttttc 



<210> 661 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 661 
tgaaggtcaa ctgtgcgc 



<210> 662 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 662 

aacgaggcgc acgtcataaa tacccc 



26 



<210> 663 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 663 

gccagcatag gctgttgaca c 21 



<210> 664 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 664 

agacttttct atacttttta taacattc 28 



<210> 665 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 665 

ggggtattta tgacgtgcgc 20 



<210> 666 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 666 

ccgtcacgcc tcctgtctgt gat 23 



<210> 667 

<211> 22 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 667 

tcctgacagt gctcaatcag ga 



<210> 668 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 668 

tcctgacaat gctcaatgag ga 



<210> 669 

<211> 16 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 669 
gtcccggatg tggccc 

<210> 670 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 670 
atcacagaca ggaggcg 



<210> 671 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 671 

aacgaggcgc acggactgtt ttctgc 



<210> 672 
<211> 27 



<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 672 

cttgtcaaag tcctgatagt gctcctc 



<210> 673 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 673 

cttgttgaag tcttgatagt gttcctc 



<210> 674 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 674 

gcagaaaaca gtccgtgcgc 



<210> 675 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 675 

ccgtcacgcc tcactgcggt cat 



<210> 676 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 676 

gtggataact gcatcagtgt atggcatttt c 
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<210> 677 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 677 

caagggttgg tagcctgtgt gagcc 



<210> 678 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 678 
atgaccgcag tgaggcg 



<210> 679 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 

<400> 679 

ccgtcacgcc tcagagccaa tcac 



<210> 680 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 680 

cgatcatcaa gggatggtgg cctgtgc 



<210> 681 

<211> 31 

<212> DNA 

<213> Artificial 



<220> 

<22 3> Synthetic 



<400> 681 

ctgatcaatc tccttttgga ctttctctgc g 
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<210> 682 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 682 
gtgattggct ctgaggcg 



<210> 683 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 683 

ccgtcacgcc tcctcttcaa tttctg 



<210> 684 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 684 

ccctgtcaat ttcttcatga agttta 



<210> 685 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 

<400> 685 

ggtatttcat gaggatcagg age 



<210> 686 

<211> 20 

<212> DNA 

<213> Artificial 



1101 



<220> 

<223> Synthetic 
<400> 686 

cagaaattga agaggaggcg 



<210> 687 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<22 3> Synthetic 

<400> 687 

aacgaggcgc accgggtccc a 



<210> 688 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 688 

tcccctgttt cttgaaaagt ccatgtgtga 



<210> 689 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 689 

aatccgtaga ggagcaccag g 



<210> 690 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 690 
tgggacccgg tgcgc 



<210> 691 
<211> 21 



<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 691 

ccgtcacgcc tcctcggcag g 



<210> 692 

<211> 29 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 692 

cacaatatcg taggtaggag gtgccttaa 



<210> 693 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 693 
gccccatcga tctcctcc 



<210> 694 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 694 
cctgccgagg aggcg 



<210> 695 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 695 

aacgaggcgc actaggcttt get 



<210> 696 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 696 

ttcatgtagt cagggtcata gacaattaag 



<210> 697 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 697 

tccccagaac catcgaggaa agg 



<210> 698 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 698 
agcaaagcct agtgcgc 



<210> 699 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 699 

aacgaggcgc acagaaggcc cctt 



<210> 700 

<211> 31 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 700 

ccttgaacag caccagaaat agactgagca 



<210> 701 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 701 

ggaagaaccc agagacacca tec 



<210> 702 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 702 
aaggggcctt ctgtgcgc 



<210> 703 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 703 

aacgaggcgc acgttgtgat acctt 



<210> 704 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 704 

gatgaaggcc ataaattaaa attgtgc 



<210> 705 

<211> 16 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 705 
tgggtatgga acgtcc 



<210> 706 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 706 

aaggtatcac aacgtgcgc 



<210> 707 

<211> 14 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 707 
cccccttttg gggg 



<210> 708 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 708 

ccctatcttt aaagttttta aaaagtttga 



